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The Small Island: 10848bp (SEQ ID NO: 1) 

TTTTCGAAGGGGCGAGTTGCACAGGCCTAGGAAGGGGTGTTGCACCTTTTACTT6TCCAGCAGCCGTCTGGGCCGAACAT 
TAGGCCCGCATTGACGTGATGTCAATATGGAATAGCCCGCCTATTTAGGTGGGCTTTTCTGTTTTTCAGGCCCTGAAACC 
ATTCTGCAGGCGTAGCAATGTGCCCTCGACGTAATCAGCGATTACAGAGGGTACTGCGGTTTCGGCCTTGAAATTGCGGG 
GCTCTGCCAGCATGCCGGCGCTGGAATCCAGGCGCTCTCGGTAATTCCAGATTTTTCCTACAGGCCTAGCGAACTGCTGG 
GAGGCGCAGCTACCTTTCGTCCGGTAGCCGGCCCCTCAGTATCGGTATCGCTAGGCGAGACTGCTTCGGGCGGTAGATCC 
CCATCCTCCAGATCCGCTTGTTGCGGTGGAAGCCGCGGAGGCCGTGGCGGCACGGTTCGATAGGTGTTGAGCCATCAGAG 
TGAACAGCACGCTTTGCTCGAAGGTGTCAGGTGGCTGTGGTAGCGCTTCTTCCCACGCGGTTGCATCTCCCTGAATGCCT 
GTCCCTGGTGTGTGCTCCCTTGGTTCTGCAGGTTCTGTCCTAGCCTCCTGTGGCGGCGCTTCACCCATTCGAGCGAGAGG 
TTGCCCGCATGGCTTCGGTTTGGCAGAGGCTTGCGGGGAGCCGCTGTAATCCACCTTCCTGACGTTCTCCAGCGCTCCTT 
GGGCCAGGCTTCTGGCATGCAGCACGGCACGCATGTGGTCGTAGTCGGCCTCGATGTAGCTCATGGTGGTCTGGATATTC 
GAGTGGTTGAGCAGGCACTTCGTGAGGTGGATGTTCCGCTCGGGTGCCTTCATCAAGTCGGTGGCCAGGGTGTGCCGGAA 
ACGGTGCGGAGTCATCCGCACCCCAACCTTCTCGGTCAACTTCCGGTACATGGCTTCGACCTGGTCGGAGTTCATCACCT 
TG CTCTTGTAG TG CGG TG AG AACCGGTTG ACG TTG AAC AACTGGTCGTCATCGGCG AATCCGGCTCTATCGGCCTCCTGC 
AGGAGCCGCGATAGGTGAGGCACAAGCCCCTCCGTTATTGGCACTACGAACTCTTTATGAGTTTTCTCTGTCTCGCCGCG 
GATGAGGATCAGTTGATTTTCCCAGTCGATGTCGCGCTTGCGGATGCACAACAGCGCATTCAACCGGATGCCGGTGAAGT 
AGAAGACCTCAAACGTGCAAAGCCAGAACCAGGCGGGAGTGATGCGTGCGCGATCGCCAGTGCAGCGCTCGGCGCCGACC 
TGCATGTTGAGCCAATTGCGGGCGCGCAGGATGGCTTCGGCTGCGACGGTTTTGCTTGCTCGCCTGGGGGGGATGACGGT 
GGTCTTTCTGAACGGGTTGACTTGGGAGTGTGTCACCAGCTCGTGCTCGATGGCATAGCCCCAGATCGTTCGCAGATGAT 
TCGAATACGTGTTCCAGCTCCGCTTCGACAGGCCTTGTTCCAGGACCTTGCGCCGCCATCCCAGCACAGACCTGTGGTCC 
ACCTCCTGTACGGTTGCCGTAGGGCCGAAGTGCTTGAGCAGCGCCTTGGTCGCGGCGCGGTAGATCTTCGCGCTGGCTTC 
TCGGAGATCGTGCGCGAAGATGTACTCCTCGGTGAGCTGCTGCGGCGTCATTCCACACCTCCTTCGGCATCGGTGATGAC 
CGTGAGGCTTGGGTTGTCCAGAGGCTGCACAGGGAACAGCAATTTGGGATCCTGGAGCAGGTAGGCCTTGAGCTCTTTCG 
TCTTGCGAGGACCAGAAACCTTGATGGTCCAGATATTCAGGTTCTTACTGGTCTTTCGGTGAAGACCCTGTTTCTCGAAC 
GCGCGCTGCACCAGCTTCCAGCCGGCGGTCTCCTTGGCCTGGGCCAGCTTTTCAACCTCCGGATGCTCCTGGACATAACG 
CTTGAAGATTCCTGGCGTGACCAGCATGGCGGTCCCGTCGACGGTATGCACCAAAGCCTTGGTGTCGTTGATGAACAGGC 
GACGGGTCGCGATACCAGATTTCATCCAGCCAATGAATCCCTGTCCCAGATCAGTGCTTCTGCTAGGCATAAATACATCT 
TCAATTGCTTCAGGAGCGCAATCTGTAGTGGTCAACCCATCCCGGACAGTGGGTTGAGTTTTTCTTCGGCCACCGCAGGT 
GGTAACCCGTCGTTGAATTGATGCGGCCTGATCCAGTTGTAGCGGTGCATCAAGTAATGACTGATGTCCCGTTGGGCCTC 
CTGCGCCGTCAGGTAACCCGTTGACGGGACCCACTCCGACTTCAGACTGCGGAACAGGCGCTCCATCGGCGAGTTATCCC 
AGCAATTCCCCCGACGGCTCATGCTCTGCTGCATCCGATAGCGCCAGAGCCGTTGCCGAAACAGGCGGCTGGCGTACTGG 
CTGCCCTGGTCTGAATGGAACAGCACCTGCTGTGGCCTGCCGCGCTGTTCGTAGGCCATGTCCAGGGCCTTGATCACCAG 
TTCGGCATCCGGCTTGGCCGAGAACGCCCAGCCGATCAGCAGATCCAGCACCGCGGCCAGGTAGTGCCAACGGCCTTGCG 
CCCAGACGTAGGTGATGTCGCCACACCACACCTGTATGGGATGCTCGGTCGCGAATTCGCGGTTCAGCCGATTCGGGATA 
TCCGGCCGCTCAACCGTGGCCTGTTTGTAGGCGTGCGAGCCCGGTTGCTTGCTGACCAGGCCCAGCTCACGCATCAACCG 
ACGCACTCGGAAACGGCCGATGGTCACGCCCTCTTCGCGCAGCATGCCCAGAATGCTGCGGCTGCCGGCCGAGCCCCGAC 
TCTGGCTGAACAACTGGTTGACTTGGCTGCGTAGCGCCACGCGGCGAGCATCGACACGCCGCCGTCGAAGACGGTGGACG 
TAG TAG CAAG ACCGCG CCAC ATCG AAAG CTG AG CAG ACCACTTC CACCG ACTCCTGCTCACTCAACTGGTCTATCAGCGC 
GTACGATCGAGTTCGTCCGACATCAAGAGAGCGGTAGCCTTTTTTAATATCGCTTTCTCCCGCTCCAATCGGTTGATCCG 
GGCTTCCAGCTCCTGGATCTTTTGCTGCTCAGGCGTCAACGCCTTGCTCTTCGGGGTCACACCCTGGCGCTCCGCCTCGA 
GCTGCTTCACCCAACGGCGCAAGGCCGAATCCACCACCCCCAGCGAACGGCAGGCGTCGATATGGCTGTAGCCTTGGTCC 
AACACCAGGGCCGCGGCCTCTCGTTTGAACTCGGCGGAAAACGTACGTCGTTGCTTGCTCATCAGACACCTCTTTCACGG 
CGAGGATTCTCGCCTAAATCGGTGTCCGGGATCAGTAGACCACTACACTGCCGAAACAGTTATTTTTGTTGCTAAGAGGC 
AGGAAGGCTGCGCTGAGATTCCTACTAGTGAATTAGATAGTTGTTGTCTTCTGAAACAGAACTGAAGCGAAATTGGGGGT 
AGGGTTTTTCTAGGTGAAGGGTAACTCTTGAGAATTACAT^AGGTGTGACGCAAATGTTGTATTTTTCTTGCAGTATGAAG 
ATGGGTGGTTGGGTCGGATATAGGTACTTCTCTCTATTTTCTTTAATTGCTCTCATCTATGGGTGTGTCGGTGGTGGAGG 
TGGATCGGATGAGATTGGGCAGCACTGCTTTGAGAGAGAGCAAAAGCTTTCCGGAGTTAATGATAATGAAGAGGGGAGTG 
TGAGGTTGAATCGGCTGAACTGCGATCCAATTGAAGGTCGTGTTCTTGAATCAGAGAAGCTGATAAGAAAGCCGCCCAAT 
GAGCTGGGTATTCACTGAGTTAATAATTTTTAGCCTGGAGAGGTCTTTTTGAAAATGAAAAAATCACTTGTTATGTCX3GC 
TGTGCTTTTGGTGGCTAGCAATTTCGCGTGTGCTGATGAGGGCTCAAATGATGGAAGTGAGATATGTCGGGCGCAGGGTG 
GAGTTGAAATAACAAGTCTGGGGGAAGTCTCAAAGGGTGTGGATGTTGAAGATGTTGTAGTTTGTTCGATTCTTCCAAGT 
AATATGAAGTCGAGTCAAAGAGCGCCTACACTCCCTCCTCTGCAAAGGATGATCATTTCGGCAATGCCTTCACCAGGAAC 
GGTCACTGTTTCTGCCAGCGGAGATAGGAAATTTACAACATCTTGCCGGGCAAATCTTTATGCTCCACGTTATGCCAATT 
TCTATCCAGACGGTGTTAGCAGGGGAACATCAGATCTACGATGTGTTGGTTACAATACACCCGGGAATTCATCTCAAGGG 
TGTAATGTGTCATGGGACGGCCCGACCGACATTCAATTGGGTGTTGAGCCATATGGCGGATCTGTTGTTGTA7^ACTACAG 
TTGCACTGCATTCAAAACAACGATTCCAGTGATAATGAGCTACAGTTATCGTGATGGGCGGGCAGTGTATGGCGAGGTCC 
AGAATGTGTCAGGAATAATAAATGTGGTTTTGAACTAAGGATGCGCCTTAGTAGGATTTTATTTCAAGGTGTTCAGCTTG 
TGAATGTGCTGCAGGGCACTCTATAGAGTGCCCTGCATAGGCTGGAAGTCGTTTGGCTGAGGGCTAGTCTTTCGAGTTCG 
ATATGGCACGTTGATGAAGGCTGGTAAAAAGTCCTGTCTGTGTGGGTGGCAGGCGAGTGGCGCGTTCAGACGCTAATGTG 
GAGTTCAAGGTAGGGTTCCGACGCCGAGGATCCTCTTTGGGTGGAGTCATTTGCTGAGCCATGCAGTTATAAGACAGGCT 
ACGTGCTCCATTGACTTGGACATCGATACACGAAGATTCAGATTGTTCCGAGTGTGACTCTGCGGGCGGGGAGGCAGCAG 
CCTGCTGTGCCAAACCAACTATAGGCAACAAGAATATAATTCGAAGAATTTTAATAAGCATTCCAATCTCCTTAACCCTG 
AGGGTTGGATGAGACTCAATAGTAACCTCAAGTTATGAGGAAAGATGAACCGCCTATCGTTTTGTAGTCTTGGTTAGGTC 
AGG CACACG CTAGTG AGTAGTACTAAG AGT AAGC CG ATAG CCAGGGGGCG TGGTGGCCCATTTGGGG AAGTGATGAAG AG 
GTGCGGGCTTGTACCGGTTCGAGGAAGGAATAGACAGCAGACAGGATCGCTTGCGATGGGGCAGCAGGAAACCATCAGCC 
CGTCCGTATCCAGAACTGCTGCTTGCAGCGTTAGGGGTGACTCCCTCATGCCCTAGAGGGCATAATCCCTGTATGGAGAT 
GGCTAACTG ATATT AAGG AGTGGTCG AAG TCATGG AACG CTTGCTCGAGAGCATTTACATCAATGCCCGGCCGG CG ATGG 
AGTTGAGGCTTAGCCTCACCAGCTCCGGCCGCAAGAGAATGGTAAAGATTGTGGATGGGGAGGAGGTCGAGGTTCTGCCA 
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FIGURE IB 

GGTGAAGTGCAGGGCATCCTGGAGGCCCAAAAGAGGGATGTTGGAATCCTCGCCGACTTCTTAGCCAAGAGTCTCGTGGC 
G CG ACGCTAG AG AT CCTTGGCTAGCC AG TCTATCG CCG ACAG CAACGTGGTCTCGTCGGC C ATGCCTTCAAGCTTCACGC 
TTCGTCCTGTCTGGTTGTCCCGGATGACGAGGGTCGGGGTCGCTGTAATGCCCTTCGACTGTGCGGTGTCGATGTTCGTA 
AGCGTCCGGGGAAGGCACATTGCGATGCGCTAAGCTGTAATCCTTTAGGATTGAGGTCTCAGCATGGAATGCCACGTTCG 
TCCCGCCACGAGCAGAGATGCAGCAGCGATAAGCTGCGTAGTTATAGCCGCCCTGCGTGAGTCAAATTCACAGGACTATC 
CGCCTGATGTGATCGCTCAGGTTGAGCAGAGCTTTTCTCCTGAAGCCATCACCACACAGCTTACGAAGCGTAGGGTCTTC 
GTAGCCTTATTGGGCGAAAACATTATTGGCACTGCCGGTCTCGACGGTGACGTCGTCAGAAGTGTTTTCGTTGACCCAGC 
TCACCAGAAAGGCGGTATCGGGCGGCATTTGATGGATGTCATTCATACAACTGCTGCCAGCGCGGGAGTTGGAGCTGTAC 
GTGTGCCATCGTCGATTACAGCTGAAAGGTTTTATACCGCATTGGGTTATCAGAAAATCCGCGACGAGTTTCATGGGGCG 
GAGCGCACCATCGTTATGGAGAAGCGGCTGTAGGATTATTCCAGACTATTTACCCACTTATGCGGCAACAAATCAGTGAT 
CTCACTGGCCCGCTGCGTGGGCAGCCGCGTCAGCACATCTTTGAGATAGGCATACGGATCATGCCCATTCATACGCGCCG 
ACTAGATCAGGCTCATGATTGCAGCCGCCCGTTTACCACTGCGCAGCGACCCGGCAAACAGCCAGTTCGAACGCCCGAGC 
GCCCATGGCCGTATTTGGTTCTCGACCTGATTGTTATCGATGGGCACAGCCCCATCGTCCAGGTAGCGCGTCAGCGCTAC 
CCAGCGTTTCAGGCTGTAATCGAGGGCTTTGGCCGTGGCTGATCCATTGGGCACCAGGTCTCGCTGAGCCAACATCCAGT 
CATGCAGTTTTTTGAGGATCGGCACCGCCAATTCTTGTCGTATTCGCCAGCGCTCTTCATCACTCATGTCCCGCGCCTGA 
CGTTCG AC CT CGTAC AAG CCGCTG AT CG AGTG C AGGG CCTGTTCAGCCAGCTG ACTTTTGTTCGCCACGTGCAAATCAAA 
GAACTTGCGCCGGGCGTGGGCCATGCAGCCGATTTCAGTGATGCCTTGTTCGAAACCGGCTTTGTAGCCAGCGAAGTCGT 
CGCAGACCAGCTTGCCGTTCCAGTCACCCAGGAAGTTGCGCGCATGTTCGCCAGCACGGCTTGGGCTGAAGTCGTAAACC 
ACCGCTTTGAGCCCTGAAAACGGCGTCGTGCTGTACGCCCAGACATAGGCCCGGTGGGTTTTCTTCTCGCCTGGCGCAAG 
CATTTGCACCGGTGTTTCATCAGCGTGGATCACGCCCTGGTTCAGCACGGCTTCACGCAGTGCATCGACCAGTGGCTGAA 
G CCGCACGCCGGTTTGTCCG ACC C ACTGCGCCAGGGT CG AGCG AG CAATTG CC AGCC CGG CGCGG CCAAAGATTTTCTCC 
TGCCGGTACAGCGGCAAGTGATCGGCAAACTTGGCCACCATCACGTGGGCCAACAAACCTGCGGTCGGGATACCTTTGTC 
GATTGACCTGCCCCCCGATTTCAGTACCACGCTGAACTTAGTAGAATCCGTTTTCCAAGCAGGAGACGGCAGTGCAGAAG 
CGTTCTTCTCGACTTCCTCAAGCGGGTCGAGGCTAGTGTACCGACCGAGCTGCACGCCCAACTGGTGGAACTGGCCCAGG 
AGTATCGACGCTGTAGCTAACGCCGCCTGCACATCCTGCTGTGGCGTGCAGGTCAACCACAAGCGGATTTACCGGCTGTA 
CCGAGCCGCCGGCTTGATGGTGAGGCGGCGGAGGGTCGCGGTGGCGCGCGAATGCCTGAGCCTGTCGAGCGCACCGAACC 
AGGTCTTGTCGATGGATTTCGTCTTCGACGCGCTCAGCACTGGGCGACGGATCAAATGCCTGACGGTGGTCGATGACTTC 
ACCAAGGTGTCGGTCGACATCTTGGTGGAGTACGGTATCAGCGGTTTTCGTGTCACGCGGGCGCTGGACGAGATGGCGCG 
GTTTCGTGGCTACCCGCAGGCGATCCGCACCGACCAGGGCCCCGAGTTCACCGGCAAGGCGCTTGATCAGTGGGCCTGTC 
AGCGTGACATCAAGTTGAAGCTGATTCAGCCTGGCCAGCCCACGCAGAGCGCCTTCATCGAGTCATTCAACGGCAAGTTC 
CGGGGCGAATGCCTCAATGAGCACTGCTCGCTGGTCGAAGCCAGAATCCGTATCGCGGCTTGGCGGGATTACAACGAGCA 
CCGACCACACAGCGCCATTGGCAATCTCTCCCCGGCAGAGCTTGCTGCGAAGTGGCGAACCAACCAGCAGCAGCTGAAGC 
GGGAAAAGTTGATATCAACCCCATAGCCTACTAACTAGGCAGCGGTACTAAAACTGGGGGCAGGCCAGTCCGAACTGATA 
7^AAAAACGTCCGAAACTGGCAAAACCTTATGGCCGGTTTTCCTAATATTTGCTCCGAACCCTCGGTATTTCAAGGAGCGA 
ACATGCATATCCAATCGTTGGGGGCTACTGCCTCCTCGCTGAATCAGGAGCCTGTCGAAACCCCGTCGCAGGCAGCGCAT 
AAGTCCGCCAGCTTGCGTCAGGAACCTTCAGGGCAAGGTCTCGGGGTTGCCCTAAAGAGCACGCCGGGAATACTTTCCGG 
GAAGTTGCCGGAAAGCGTTAGCGACGTGCGTTTCAGCAGTCCCCAAGGGCAAGGGGAGTCCCGTACTCTGACTGACTCGG 
CAGGGCCGCGGCAGATCACTCTGCGCCAGTTTGAGAACGGAGTCACCGAGCTACAGCTCAGTCGGCCACCATTGACCAGT 
CTGGTCCTAAGCGGCGGTGGTGCCAAAGGTGCGGCATACCCGGGAGCAATGCTGGCGCTAGAAGAGAAAGGCATGCTCGA 
TGGCATCCGCAGCATGTCCGGTTCGTCCGCTGGCGGCATCACCGCCGCCCTTTTGGCCTCAGGTATGAGCCCGGCGGCGT 
TCAAGACCCTTTCCGACAAGATGGATCTTATTTCGCTGCTCGACAGCTCGAACAAGAAGCTGAAGCTGTTCCAACACATT 
AGCAGCGAGATCGGCGCATCGCTGAAAAAGGGCTTGGGCAACAAGATCGGCGGCTTCTCTGAGTTGCTGCTCAATGTACT 
CCCACGCATAGATTCGCGGGCTGAGCCCCTAGAACGCCTATTGCGCGACGAGACACGCAAGGCCGTGCTCGGACAGATCG 
CTACGCATCCAGAGGTTGCACGCCAGCCGACCGTTGCCGCCATCGCCAGCAGATTGCAGTCCGGCTCCGGAGTCACCTTT 
GGCGATCTAGATCGGTTGAGTGCTTACATTCCCCAGATTAAGACGCTGAACATCACAGGTACGGCCATGTTCGAGGGGCG 
TCCGCAATTAGTGGTGTTCAATGCCAGCCACACACCGGATCTGGAGGTCGCCCAGGCGGCACATATCTCCGGTTCCTTCC 
CAGGAGTGTTCCAGAAGGTCAGCTTGAGTGATCAGCCGTACCAGGCCGGCGTAGAGTGGACAGAATTCCAGGATGGCGGG 
GTGATGATTAACGTGCCGGTCCCTGAGATGATCGACAAGAATTTTGACAGCGGGCCACTGCGGCGCAACGACAACCTGAT 
CCTTGAGTTCGAGGGCGAAGCTGGGGAGGTAGCGCCCGACCGAGGTACTAGGGGCGGCGCGCTCAAGGGCTGGGTCGTCG 
GGGTG CCTG CCCTGCAGG CG CGCG AAATG CTG C AGCT CG AGGGCCTGGAGG AATTG CGCG AGCAAACCGTTGTGGTGCCG 
TTGAAGAGCGAGCGCGGTGATTTCAGTGGCATGCTCGGTGGCACCTTGAACTTCACCATGCCGGACGAGATCAAGGCGCA 
TCTTCAGGAGCGCCTCCAGGAGCGAGTCGGTGAACATCTGGAGAAACGTCTTCAGGCTTCAGAGCGTCATACCTTCX3CTT 
CTCTCGACGAGGCGCTGCTGGCACTTGATGACAGTATGCTCACCAGTGTTGCTCAACAGAACCCGGAGATCACAGACGGG 
GCGGTGGCTTTTCGCCAGAAGGCGCGGGATGCGTTCACCGAGCTGACTGTCGCTATCGTTAGCGCCAATGGCTTGGCGGG 
TAGGCTCAAGTTGGACGAGGCTATGCGCTCCGCTCTTCAGCGACTCGATGCGCTGGCAGATACTCCGGAACGCCTAGCAT 
GGTTGGCAGCTGAGTTGAACCATGCTGATAACGTTGATCATCAGCAGTTACTCGATGCCATGCGCGGGCAGACGGTGCAG 
TCGCCGGTGCTCGCCGCTGCGTTAGCAGAGGCGCAGCGCCGCAAAGTGGCGGTTATTGCCGAGAACATTCGTAAGGAAGT 
TATCTTCCCCTCTCTGTATCGCCCTGGCCAGCCGGATTCCAACGTAGCTCTGTTACGTCGGGCGGAGGAGCAGCTACGGC 
ATGCCACCAGTCCGGCGGAAATCAATCAAGCGCTGAACGATATCGTCGACAACTACTCGGCACGAGGCTTCCTGCGTTTC 
GGCAAACCCTTGAGTTCGACTACCGTTGAGATGGCTAAGGCTTGGCGGAATAAGGAGTTCACATGATTGATACATGGCTG 
GCACAGTGGGGCTTGAGACTTCCCTCGAGCAACGATGCCACGTTGCGGCTGCAACCGGCAGAGGGACCGGAACTGGTTAT 
GGAGCGCCTCGAGGGCGGTTGGCTTTTCGTCGTCGAGTTGGGACTTGTGCCTTCAGGGTTACCGCTGGGTGTGATCTTGC 
AATTGTTACAAGTGAACTCTCCATTCTCATCCTTGGCACCGGTGAAACTTGCGGCGGACGATGCCGGTAGACTTGTGCTC 
TGGGCTGAGGCACGTGATGGCGTTGACGATGTGGATGCACTGAACCGCTTGCACGATAGGCTGCGGGAAGGACATTCACG 
ATTAGTGCCATTGCTAGAGCCCACGGGTGAGTTGGTTCCAGCTCAGATACAAACC^GCGCGTTAGTGTTCGTTTGAACGC 
AAGGGGAAGGAGCGATACTGCCAATAAAATCCGCAGTTGGTAATTTCTCTAATAGATGGGCCACCCGAAGGTGGCCTATC 
TGTTCATGCTGCAATGAGTTGTCGTGCCAACACCATTTCTACCGAGCCCCATCCTGGTTCAACGAGCCGAGGCCCTCGAG 
GTGGGCTTTTCTGTTTCTGGAGTCCGGATATCTGATCCACAGCGCGCCAGAACGAGCGGCTGGTTTCGTGCTTTCGCCAG 
TTATATCGGCACTGGCCCACATTCGTGTGCCTGCCAGTCTGTTTCCAT 
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The Big Island: 84830bp (SEQ ID NO: 2) 

CACCGGCGATGCATGGGATGCCGTAGGGGTCGCTGATCCAGCCGAGGCCGCCCTGGATGGTTTTCTGGTCGGCGCTGGCG 

GTCTGGTTGTTGTTGCTCTGGTTGTCATTCACCTCCTCGGCCGGCGCCGGGAAGGTGCGCACGGTCCCGTCGTTGAACAC 

G AACG TG AGGCTG CGG AT CTGC CC ACG C ACGCAGG AG AGTGTCC AGTCGCCCGAGGCGGTCCCGCTGGCT ACCGCGCCGG 

CGACGTCCGGCAGCTCGATGCCGTTGGCGGTGAGGTTGTCCGGGCCGATGAGGATTTTGAACGGGTAAGGATCATTGACC 

GTCCCGTCGACCGGCACACGACCGATCAGCGCAAACATGGCCACCGAGCCCATGAGCGTCGAGTTCTGCGGCAGCGTGTA 

GGTCTTACGGACGAGCTTGCGGTTTTCCAGGTCAGATCGCTGTTGGCCGACGGGGTGCAGCCCGTCATCGATCCGCTCCA 

GCGCGTTCTGTCCGCGATCGACCGCATTGCCGAAGGAGGTCGGGAAGCTGAATCCGCTCGGTTGGGTGGTGGAGCCGGCG 

GCCAGCGGCTGGCCATTGGCATCAACCGCGCGGGCGTCCTGGGGCTCGATCCAGACGATATCCGATGAAGACGAGCCCGC 

TCCCTGAAAGTGCTGGCCATCCTTTGGTCGCACGCCGAATCCTACGGGCAGGTCGGTGTCACCGGCCTCGGGAATGTGGG 

TTAGGTTCTCGAGCCGi iTCTG'l'ACC'i'GGTCCAGGATCGTCTGGCTACGGTTTTCCTGCTGTTGCTGCAGTTCCTGGGCG 

GTGTTGTTGAGCTTCTGCTCGATGTTCTGATCGATGTTGCGCAGGCGGCCCTGCAGGTTCTCATTGGCGGCTTTGAGCGA 

GTCATTCTCCTGGATCACCTTGCTGATCTGGTCCTTGAGCTGCCGGCTTTCCGCCACGATGGTGCGTAGTGTGTCGGCGG 

GCGTGTCGCCGTCGATGCCCAGGGTTGCCGCTTCTTCCGACGTTACCGTCGGGGTTGCGGCGCCTTGTGACTGCGCACTT 

TCTTTCTTGCCCATCATGCTCACGCCGATCAGGATGGCGCCGATCACGACGGGGACGACCAGCAATTTCAGGAGGGGATT 

ACCGGTCAGCGCCACGACGGCCTCCTTTGGGATCGATCTGGCTGATGGAGGAGGGCAGGAGCGCGTCGGCAAGGCCGCGG 
"~,ZCCGCGCGTC^eAGATAGACGGT^ 

GACGAAATTGCCCATCAGGTCCCTGGGATCCAGGGCCAGGTGCTGG^CGCTGGCGTTCTGCAGCTTCACCGCCGTGATGT 

AGTAGTCGTCCAGCCGCCAGGCGCCCAAGGCGGTAGCCGTGATGGGTAGGCTGGGGAGCAGGGTGGTCAGGTCGAGCTGT 
CGCTTGACGCGCACCTGACCGACGCCATCCACCGGTTCCACCGTGCGAAGCGGGGCATAGAGCATCTGCGCCGCATAGCG 
CGTCAGAACCACGGGGACGGGCGTTTCGCGCGGCACGGCCTTCGGTGCTTCTGCGTGCTCGGTCTGTTTCGCTGCTGCCG 
ATGGCTGGGCTTCCCGGGACTGGCCATAATGCGGATCCACTGGCTCGCCGGCGACGATCCTGACCGGCTCGCGCGGCTGT 
TGGTCGGCCGTTGCTTCGGTGGCGGCGATATCGATGAGCATCTGCTCGCCATTGGTCGCGTCCTGTAGGCGCAGGCGCGC 
TGGAGGAATCGGCTCGTTGGCGAGCAGGTAGAGTGCGCCGCCGGTACTCTGGACGCGCAGCTTGCCCTGCAGATCCCGAG 
GAACCCCAACTCGCACGTTTCTGTCGACGAAAACAATGCGTTCCTGGCCGACCGTCAATGGAATGGCCAACGGAATGCGC 
TCCCAGCGCAGAATCTCCACCGCGTGGGCCAGTGTGGGTAGGGCAAGCATTAGCAAGAGCGAGCCTGTCGACTTCCGGAT 
CATAGACCTCCCTCCCGCTTGGTGGGGGCGGCCGGCTCCTCAAGCTCGATACGTTGAGGCGTGTCGGAGTAGCAGTCCCA 
CTGCAGGCCGAAGGGATTGGTTTCCGGGTCGACGTCGGCGCGGATAACGTGCAACGGGTAGCGGGCCAGCGCCCGCTTGA 
TCTTCTCGCCGGCGTAATACTCCGTGCTGTCCATGTCCAAGTTGACGGTCCAGTCATTGATCGAGTGCTGGATCACGCGG 
CCATTGCTCTCGCCAATGCCTCGACCGGGGATTTCCGAGGTGGTGCGCTCGCGACCCCTGAGCTCGCCGGCGTTACGACG 
AAACTCAAAGTCTTTCTCCAGGAAGACTTTGCAGGAGGGAGTGAGGTAGGCAGCGTAGCGGAACAGGTTTCCCTTGTAGT 
CCACCTCGCCGTCCTTGGGCCAACGCTGCACCTGCTGGAAGATGTAGAGGCCGAACGCATAGACGCTCTCTGGCGGAATG 
TCCCACCACAACCGGGTGCTTCCTGAGCGCAAGTCGGGCGGAATGTGGATCCAGAGGTCCTTACGGGCTTCCCAGACGCA 
GTAGGCCAGCACAACGATGACCATGCACAGGAAGCCGGTGATGAACCGGAAGGTGTTGATGTGTGCCTGCTGTTGCGCAG 
TGTGTTTTCTGAAACTCATGGGGACTCCGTGCGTCGACTGGTCCAGGCGCCGGAGCGCGTAACCAGGCGACGGTTGCCGA 
AGACGGGAAAGCGCAGCGAGAGGGCCATCTCCACCTGGCGGTAGAACCAGGTATCGGGCCGCCCCCGCTTCATCCGCCGC 
AGGACGCGGCTGCCGATACCTAGGCCGAGGGCGCCGACCAGCAGCGCGCCCAGTGGAATGCAGGCAGCGTTACCTGCGAC 
CAAGGCAGCCGGGATGCCAAGAACGAACCCGGCTGCTCCGCTGGTGAAGACCGTGATCCACATTTCGTCTGCGGTCAGGC 
CGCCGATGACTACCGGTTGCCGGTTCAAACGGGTCGGCAGGAAGCTGAGGGTTCCATCCTGAAACAGATGTTCTTCGGGC 
ATGGGAAGGCCCTTACATGATGGCGGTGGCTTTGGTGACGAGATAAATGATCAAGATCAGCAGGCCGACACCTACGGCTA 
CGCCCGCTCCGAGATCCGACCACTTCTTCTTCCCGTCATGGATGGCGTGATAGGTGCCGTAGGTATGCCAAGCGACCCCC 
AGAAAGACAGCCGCGCAGATGAGCAGCGCGAGGAGCATCGCTCCGTCATAGCCGAAGTTCTGGATGGTTTGCATGATGCC 
CGATCCCTCCCCACGGCTAGGTGCCTCGGGTTTGGGGAGTGCAGCCAAGGCGATACCGGGGAGCGCCAAAGTGATGGCGG 
CCAGGCTCTGGCAGAGGGCGGACAGTTTCTGGAGGGTGAGTTTCAGCATGAGTGTCTCCGTATGGGTCAGCTGAGGAGGA 
AGAAAGAGGTGATGCCGAGGAGGACCAAGATCCGGGATCGCGGAAGCGCCACCGGTGGCGCTGTCGAAGGTTGTTGGTGG 
CCCAGCCGCGCCAGGTGCTGTACATCGCCCAGGCGGACCACAGCAGAAGGAAGGTCATCGCTGCTCCAATGAACAGTCCC 
T CGC CGGCCG ATGGGGG AAAGC C AGCGGCGGC CTGG AACGCTGATGT CTGGGCTCCGCTCATGCTCATGGCAT CGGCCCT 
CCGCGCAGGGTGTAATTCCCTGATAAGTCGGAAGGGTCACGCGGTTGGGCGCGGCTGGGCGTCATGTAGTCCTGCAGGCC 
TTGGCGGATGCGCTGCAGGTCAGCGGCCAGGCGCGGGTAGTCGAAGTAGAAGCGCTGTCCGGGCTCATCGGCGCCCTGGG 
CACTGCGGCGGGCGGTGTCCTCGAGGGCGTTGAGCTGCCGGATCATCACCTCCAGGTTCGCCTGCTCGGAGGCGCTCGCT 
GCGAAAGCTCCCTGTGCGACCAGCGCACTGCAGGCCAAGACGGCCAGGCAGCTTCGGCGAAGGGATGGATGTGGAGTGTT 
CACGGCGAATCTCGTGCAGATCAGGGTGCGATCAGCTTGCCGATCGCCCTGATTCCATTCCGCAAGAAATACTCGGCGCG 
GGTCCGGGGTATTTTTTTATGAGGTTGCGCGCTTGAAAAACAGCAGGTAGCTGACGGGCAGAGCGACGCCCCAAGCCCAG 
C CTTGGGCGTT TTG ATT AC CG ATCT CGGTT ACGCCCCGCATG AG TGG CTG AACGCTCACCAGCTCCC AGCCTTCCTGGCC 
CAGACGCACGACCTCGGCCCAGCGCAGAGGGTTGAGCAAGGCGTGGTCGTCGGGTTCCAAGGTGTCTTCTTTGTGTTTGA 
ACATCGCGACGCCCTTGGTGAGGGTTTTGTAGGGAATGACGATGTTCTCGACATGGTATTCGAAGGGCTGTTTTATGCTC 
ATGGTGTGCTCCTGGATCGGTTCGAGTAGAACCGCACGGTTTCAAGAGCAGGAGGAGGGCCTCAACCGAAAATACTAGGC 
CTAGCACCAGTGAATTTCTGGTGGTTGGCGGGTTTAGAGACACGTATGGACGTCTGGGTGTATATTTCATATATATTTCT 
AAGGGAGACGGCTGATGCTTAGAAACATCTCTATTGGAGTTTTGCTAGCCATGGCTGCTATGTTGGGCAGTTATGGGGTG 
GCTGCCGCTACATTACGATGCGGGTCGGCAATTGTTAGTGAGGGCGACTTGATTGATGATGTGCTTAGAAAGTGCGGCAA 
CCCTGATAGCCGTAAAATTGAAGGGCCCGCAGTGGATGGTAGTGGCTATATAGTGCGGGGGGCTGCTACTGTCGAAAACT 
GGGTATATGGACCAAGGAATGGATGGTACCAGAAGCTTAGGTTTGTCGATGGAAGACTAGTTCAGATAAAAGGCAGTATG 
- GACTAGGGTATAGCCGTGGATGGTGTGTTTTCATCCACGGCTATAAGTCTCATCCGGCAGATGATATAAGGGTAAGGATA 
TTTGCGATTGGTAGGCCTTGTGCGTCGGAAATAAACACGGTTGTACTGGCGCCTGCACGGGAAAACTATCTGTTGTAGGT 
TGTTCGGATTAGACATGCCACCGTTGTAATTGGCTTGGAATTGCTTGCTGGAACTTGCTATGTCTAGAAGTTCAAGTAGA 
GTGCTTTGGTTGGCGGATTGTGCGGAGTTCATGCTGATTGATGTGTTTGATCCTGTCGCTCGGAGCGCCTGCTGGATGTT 
GTTTGAAGTGAGTTGGGCACCATTAAAGTTGGCGCTGAATTTTTTAGTGTTGCCCGCGATGTTGATGGCCTGTAAAAGGG 
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; 

TGGATATATTCGCCGCCTGAGCGGTGTTGACGCTGATATTGGTACCCGAGGCCGCAGCGCTGATCGCTTGCTGAAGGTTG 

TCTGCTGTTAGTTGTGCACCATTGAAGTCGGCGCTAAACTTTTTCGTGTCTTTGGCAGAGTTGATGAGGGCTAGTAGGGT 

GCTTATATTAACCGCCTGTGCGGTATTGACGCTAATGGATGTTCGGGTTCCTGCGGCGTCTAAAGCTTGTTGAATGTTGT 

TTGAAGTAAGTTGAGCGCCATTGAACTCTGCGCTGAATGTTTTTGTGTCACCTGCTGCATGAATAGTTTGAAGAAGGGCG 

GTTATATTTGCCGCTTGTGCGGTATTGACGCTTATGCTGGTGTTTGTTCCTTGCCGCATTTACTGCTCTAAGTAGGTTGTC 

GCTGCTGAGTTGGGCTCCATTGAATTCTGCGCTAAAATTCTTTGAGTTGCCTGCTGTAGATATTGCGGAAAGAAGTGTGT 

TTATATTAATGGCTTGTGCGGCTATTGTGGATACTCTGGTTAGTGATCCTGCTGAGTTTATGACTTGCTGAAGGTTG 

GCAGTCAGCTCTGCGCCGTTGAAAGTTGTTGAGAGGTTTTTTGTGTTTTCTGCAGTGGTTGTTGCCTTGAGAAGAGTTTC 
TATATTTAATGCTTGCGCGTTGCTTATGCTGATGCTTG^^ 

GATTTAGAAGGCGTCCGTCGAAATCAAGGATAAGTTTCATCTCAATGCTCCGCTGTGACCTAACGTTA^GTCTTGGTGT 
GAGTACTTCCTAGGCTGAATGAGTTGGCTACAAATCTCTTTGCCTTATGTCAATATCAGGCA^CGCTGTGA 

ATTTTTTGAGTCATTTTTTACAGCTAGCTATCTGCTCCTTCAGTTTTCTGTCGTCCACGTACCGCTGCATGATCGACAGC 
ACCAGGCTGTTGATGGCAAGGATCTTCATCGCGCGGGTCACTGCGACGTAGA 



TGTGiGCGGiG^iCAGGGTGATTGTTGCGTCCAGCTCATCGTCAAGGGTCAGTGAGCGAAGCTCAAGGATCCGCGCA 
AGATCAGGGTAGGTCGATATGATCTTGATCGAGCGAAGCATCTCACCGTCCTGGCTGATCTCGGCGATCTCCACGTACTG 
GGTGTAGTCGCGGTAGTCACGGAGCAGTTTCTTGTTCTGGACGTTTTGGCGCAGGCCTCGGCTG^TGCGTACAGATCCT 

ccaggtcgcgcagcgagtaactgtcgataccgcctacccagtggaatttgggctccggatgattgcggaccagctgcaS 
gcattctcgatgacgccgataacggtgcggtgaatgaaagtgcggtgaggaaggtccggcgggagggactttttc^ 
■ cgtctgcggacccagtccttgaagtttccgtgtttcgcccttgtaggaga«3atgatgttggccacgtgtgc^ 

— GGCCGAATCGCCAGCTCTGGGTCAGGTAGTGCT^^^ 

CTGAACCGGTAGAGCTGCTGATGGGGATCGCCGACGA^ 

TGGGTTGATGTCCTGCCCCTCGTCCAGGAGCATGCAGTCGAAGCGCTGGCTCAAATCGGGCTTGCTCAGTTGAT^AGCT 
TCAGGTAGCCGTCAAGGGGCATCAGCATGCCGGTGTCCTGGAGATCGACCATGCGCCTCCAGACTACTCGCGCCATGTCC 
AGGCCCTGCTTGAGGAAGCGTTCCTGAGCACTGGTGAGGAACGCCTTGTCGCGGAAGCGCGGGAAGTGCGGTCGGCCGAG 
TTCCGCGTCGGCGCTGGCCATGTAGTTGTTCAGCGTGGCCAGCACGTCACGTACCAACTCCCAGTCTTGGGTArcGAGTC 
CGCGGGCGATATCGGTCAGTCGCAGGTTCTTCGTCTTCTTGTGGGCGTACTGAATGCCGTACACCGCATGAGCCA^CTG 
JGGGCGGTCTTGCACACTACGTTGCGGGGAAACTTGCCCTTCGCGGCTTTCTCCACCG 

GATTCTCAGGGTAGGGTTCGACCTGGCAAAGCCCACCAGGGTGGTAGTTTTGCCAGTGCCTGCGMGGCTCGCACCA^^ 

tcttcggtgccttcgactggatgatcggtgactgttcgtgagtccac^^ 

GGTGGATGCTGTGATGCATACCGCTACGCCGAGCAGCGCGGCGCAGGGCAACAGGATGAGCAGGGGATTGATGCTGATGG 

ggattgccaggtagagcgtccaagggacgaccgctagcggaatga^^ 
tcccggccggcgccgaacttgcgcaggtcccggcgcaccaggccgt^^^ 

cgggatcgtcatgaccaggatcaccac^cgcacgacgaatgtcagcaccgtgtacagcgcggccagIccgS 

gcgtggagacaccgtg^cggtgagatagcgaacgtccagcgggctccgtggccgggcctgcgcgatggtactcatc 
gtcatccagtcgaccatcccggtcttcacgaacaaccagtca™^^ 

CTCCTGCACGACGACGCTGTGTAGCAGCCCCTGCGACAGCCAACTGAGTTCGTACTCAAACATGGCCTGGGCATGCTTCC 

agcccgcgtcaggccagaagaaatacaggcatacccactccacga™^ 

ATTCAGGAGCAGGAGTAGGGGGGCAATCGAAAATGCCAAGACTGGTGCCGTGGGTTTTCTGGGGTAGCCGCGGGAGAATC 
AACAAAGATGTTCAAACGTTCCTATAGGAAACTGTGTTGTG^^ 

GCTTGGAGTGTGTATCCACTAGTTCTCGGGGAGGCCGTCACTGTAGATTTTTTGGGGGGGTATCTCGGCATAGGGGGGAT 

TGAATAGCGCTGTGCCTGCAGTCCTGAGACCTCATTAGGAATCATCT^^ 

!^™AAGTTGAAG^^ 

TGCTCGGTGGCGCCTTGATGTTGTGCGTGCTATGCAGCCTGATATTCAGTGTGAGCATGGTTTTAAACCATCAGGTGTCC 
CTCAGTCGGCAAGCTATGAATGTGGCTATGTACGAAGCGCAGCTTTATTTCGAGCAGCGCGAGGCGTTGCTCAATCACTT 

GAGCGGCAATGTCGTGCCCTTGGCCGCGGGTAGAGCGCTCGTCAACGAAGCGCCGAACAATGTGAGCATCCTGCCG^ 

GTGACGGAG^CGAGGTCTGCTATTGACCGCTCGCACGCTCGGTGA^^ 

GTCGATACCGACAAAGGCCCTCTGGTTTACCGGCT^^ 

CAAAGAGGTGTACCGAGCCTTGCTGGCGACTCCGTCGGCGCCTGTTCACTGGGTGACTGACGK3TGGTACCCCTCAACGGC 
TGTACCTTTTTGAATCCTTAGGCGATGAGCCGGGCGAGGGGTGGCTAGGCCTGGAGATTCTCGGCGAAGACCTCGATTCG 
ATGTTGCGCCGGAATGATGCCGGAAACTACATGCTGCTGGATCAGCATGGGCAGGTCGTACTCGC^S 
GCTGGGGAGCGGTGCGTCGCGGACGCTTTTGCGTGGAGACGGCTTCGG^ 

TGGTGCTTTTCCAGCACGTGGGGTCTTCGAGCTGGGATCTGATCTATCACATCGGTATCGGTCGCCTGTTGCTGGCTCTG 
TGGCTCCCTCTGTTACTTGCCTCTGCGTTGGCACTCGCAGTCGGCATCCTACTGCATTGGCTGGTGCGGAGCATCGAGCG 
ACGCTTGATAGAGCCCGCAAAGCGACGCCTTGAAGCATTGAAGGAGAGCG^ 

CGCCCGTCGCGCTGTGCGTGCTGCGTCGTGCCGACGCCGCAGTGGTCCTGGAAAATCCCCAGGCGCGCCAATGGCTGGGT 
GATAGCGAGGCGATTGCCCACGACGCGCCGAGATGGATTTCCCA(^^ 

ACTGGAAACCGAGGCAGGGCTACATCTTCATCTCAATTACACGCCCACCCGCTATAACGGTGAAGACGTA^GTTCTGCG 
CCTTCAGTGAAATCAGTGCACGCAAGCGGATGGAGGCGGAACTGGCTCGCGCAAAATCCCTGGCGGATGCTGCCAATGAA 
GCCAAGACGCTGTTTCTCGCCACCATGAGCCATGAAATCCGCACACCTCTGTACGGCATGCTTGGCACGCTTGAGCTGCT 
TGGGCGTACCGAGCTGAGTCGGCAGCAGGCCGGTTACCTAAAGGCAATCCAGCATTCCTCGTCGACCCTGCTGCAACTGA 
TCAGCGATGTGCTTGACGTATCCAAGATAGAGGCCGGCCAACTGGACCTAGAGTGCGTGGAATTCTCCCCGCTGGAATTG 
ACCGAAGAGGTCGTGCAGTCGTTCACCGGTGCCGCGCAGGCCAAGGGGCTGCAGTTGTATACCTGCCTCTCTGCGGAGCT 
GCCGCTGCGCATGCGGGGGGCCGCGGCGTCGATCCGGCAGATTCTCAACAACCT^GAGCAACGCGGTGAAGTTCAC^ 
.ACAATGGCTATGTCAACGTCCACCTGAAGGCCAGCGTGGTCGATGCCGAATGTGTGATGCTGACCTGGCAGGTCAACGAT 
ACCGGCATGGGGATCAACGTCGAGGATCAGCCGCGTCTGTTCGAACCGTTCTACCAGATACGCCGCTCCGAGCATCCGGT 

cgcaggcacgggcctcggcttgtcgatcagccagcgcctggcgcagctaatgaatggcagtctgaSctggtcagtgagc 

TGGGGTTGGGCAGCAGCTTTAGCCTCAGGCTTCCGCTTGAGCGGATCGCGATGCAGGCTGAGCCGCAGGACCTAGCCGGG 

tgcgccgtccaagtgctggcgcctgtccgcgacctaacggaatgcctgtgtggct^atctcccgctgg^tggaag^ 

CATGGTCGCGACGCCGAGGTCGCTGGACGAGGCGGACGCGACCTCGCTGCTGGTCAAAGTGTTACTGCTGGAGGGGGCGC 
CGATGTTCGAAGCATGGCCAGGATGCCGGGTGGAGCTTTCCCCTCA^TGATATGGAGCCGCAGGCAC 
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TGGCTGCTCGGGCTCAACAACCTGAACGGCCTGCATCGTGCTCTGGGCCTGGCCCATGGGCGTCTCGCTGATCCTTCGAC 
GCCGCCGATACGGCTGGCTCCGTTGCGCAATCTAGGTCTCCGCGTCCTAGTGGTGGAGGATAACGCGATCAACCAGTTGA 
TCTTGAGGGACCAGATGGAAGCGCTGGGCTGCAGCGTGGAGCTGCTCTTCGATGGTCGCGAGGCGTTGCTGCACTGCCAG 
ACGGCCTGCTTCGACGTGGTGCTCACCGATATCAACATGCCGAACATGAACGGATACGAGCTAACCGCGGAGCTACGGCG 
CCAAGGGTTCCGGCAGCCGATCATCGGCGCGACGGTGAACGCCATGCGTGAGGAGCGCGAGCGCTGCATGTCCGCCGGGA 
TGAACGATTGCCTGGTCAAACCGGTGGATCTGAATGCCCTTCAGAACTGCTTGATTAATATTCTCAAGGTGGATCGATGA 
GCTGGAAATCCTATCGGGTGCTGGTGGTCGAAGATCAGCCGTTTCAGCGCGAATACCTGCTCAACCTGTTTCGCGAGCGC 
GGCGTGCAGTACCTGGTAGGTGCCGGCGACGGCGCGGAGGCGTTGCGCTGCCTGAAGCAGGACAGGTTCGACCTGATCCT 
CAGCGATCTGATGATGCCGGGCATGGATGGTATCCAAATGATCCTGCAACTGCCGTATCTCAAGCATCGTCCGAAGCTGG 
CGCTGATGAGCTCCTCGTCGCAGCGGATGATGCTCAGTGCCAGCCGGGTCGCCCAGAGTCTCGGCTTGTCGGTAATCGAC 
CTGTTGCCCAAGCCGACTCTGCCCAAGGCCATCGGCCAACTTCTGGAACACCTGGAAAGATGCCTCAGGCAGAAGCTGGA 
GCCGGAAACCGACGAGACTCCGCATGGGCGCACGGCGTTGCTGGATGCCCTGCATAACGAGCAACTGGTGACCTGGTTCC 
AGGCTAAGAAATCCCTCCACACCGGGCGCATAGTCGGCGCCGAGGCGTTGATACGCTGGAGCCACCCGCAGCATGGCCTG 
TTGCTGCCCAGCTGTTTCATGAGTGATGTCGACGCTACCGGTCTGCACGAGGCGTTGCTCTGGCGCGTGCTCGAACAGAC 
CCTGAACGCCCAGGAATCGTGGCGCAGGGCGGGTTACGAGATTCCGGTTTCGGTGAATCTGCCGCCGCACCTGCTCGATA 
ACCAGGAACTTCCGGATCGACTCTATGAGTACGTCGGCGCTCGCGGGGCTTGTACCAGCTCACTATGTTTCGAGTTGACC 
GAGAGCAGTGTCACAACTCTGTCAAGTAACTACTATGCAGGTGCCTGTCGCTTGCGCATGAAAGGGTTCGGATTGGCCCA 
GGACGACTTTGGCCAGGGTTACAGCTCGTTCTATAACCTGGTCACGACGCCTTTCACGGAGCTGAAGATCGACCGCTCCC 
TAGTCCAGGGATGCGTAGAGGATAACGGCCTCAATGCAGCTGTCATCAGTTGTATTGAGTTGGGTCACCGCCTGAATCTC 
GACGTGGTGGCCGAAGGCGTGGAGACCTGCGAGGAACTGAATCTTCTTCGTCGTCTTGGCTGCGACCGGGCGCAGGGTTT 
/CCTGATTTCTAAGGCAGTGTCTGCTCGTGAGTTCGAGCGGCAGTTAAGGGAGGACGGCCCCAGCCTCCTTGTTTAACGCA 
--GTATCCCCATTATGGCGGAGTCGATCGCAGCCAACCACCGT CAGCGCAACAGTGTCGCTGAAGGAGGCCGCATCCCGTGA 
AGTCTGCTAGTGCCTTGGAGCACGACAACAAACTTTTGCTCAAATGGACAACCCTCTCGCAGAGCCTGAGCATCGGCTTG 
ATCTGTGTGGTGGTGCTGACCGTATTGCTGTTCAGCATCTGTTACTGGTCGCTGGGGAGATTGTTTCAGGAGGAGGAGGA 
CAAAGTCTCCTTCCACTTCACCCGTATGATGGATGTTATACGGGAGCATGAGGTATTTCTTGGGCGCATCGCTCGAAAAA 
GCGACAAGACCACCCAGAAGTACGACTATGACGTGGTGCCTTTGCAGCGGCACTTGTTGGCAAAGGAAAACGGATTAGCG 
GTCTATGAGGGACGGGAGTTTTCCTTTGCTATGCCATTTCTACTGGCTACCAAGCACGCGTTGAGCGCCGATTCCTCGGG 
AGATCCGTTTTCGCTCGGTGTATTGCTCGCCAATTTCTACGGAAGCTTCTGGAGTGTTTCCGCCTATCCCGCGCCACAGT 
TACTGATCTTTGATCTTTCCGGCAGCACCCGCCTGGCAGTGCCGTCGATTCCCTCCACAGCGCAGCGTGACAGGTTGAGC 
GGAAGCTATCCGATGATAGTCGAGCGCATTCTGGCGCGCTTGCGCACCCGGCCGGTGGGGGAGGACGCTCAGCGTGTCCA 
TTGGATACGCGCTGATCGCTATCGCGACTCGGCGCTGGAGATGTTGGGAGTCGCCCGGGTTGATCTGCCGGAAACACTCT 
GGTGGCACGACGAGCCGAACCATCTGATCATCGCTGCGAGCCTGCTTGATCTCAGGCGAATCAATGACTTCGAACAGTTG 
GTTGAGCGCCCGGCATTCGATTCGTACAGCCTGGTATCGCCGGATGGCGAGGTATTGCTCGGCGCGGCCCCTGCGACCGG 
CCTGAGGGATGGCCTGAACCTCACCCGACAGGGGGTCGCCGTTCAACTGCTCAGCCAGCCTGAGAACGGTTGGCTCGCGG 
TCTACCGAACCGACTACGGCAATTTCTTTCGCCACTCCCGGTGGCTGGTGGCAGGTCTGCTGCTGACCCCGGCGCTGCTC 
CTGGCCGGTTGGCTCGGGATGCGTTGGTACACCAGCAGCGTCGTCAACCCGGTGCATCGGGCGCACCGGCAACTGGTGGA 
GAGCGACACCTTCAGCCGGACGCTGATACAGACCGCGCCGGTGGCTCTGGTGGTGCTGACCCAGGATGACCAGCAACTGG 
TGACCTGCAACCACTTGGCCGCCCAGTGGCTGGGCGGGCCCACGGAGATCCTTGGGCTGACTTCCAACTGGAAGCTTTTC 
GATGCGCGTGGGCAGGTACCAGGAGACATCTGTATCCAGGTCGGTGGGCGCTATTTGCAGACCGCCTTCGCGGCGACCCG 
CTATGCCGGCACCGAGGCGGTACTGTGCGTATTCAACGACATCACGGTCCACTGCGAGGCGGAGACCGCGCTGTCCAATG 
CGAAGCGAGCAGCGGATGCCGCCAGCCAGGCCAAGACCCTGTTCCTGGCCCGCATGAGCCATGAAATCCGTACTCCCCTG 
TACGGTGTCCTTGGCACCCTGGAGTTGCTCGACCTGACCACCCTGAACGAGCGGCAACGCGCCTACCTACGCACCATCCA 
GAGTTCGTCTGCGACGCTCATGCAACTGATTAGCGATGTGCTGGATGTCTCGAAGATCGAAGCGGGGCAGATGGCTCTGA 
CCCTGGCCGCCTTCAATCCGCTGGACCTAGTGCGGGAAGTGCTTGGCAACTTTGCCGCCAGCGCCATGGCCAAGGACCTG 
CAGTTCTATGCCTGCATCGACACCGAAGTGCCGGCGCAACTGATCGGTGACGTGACGCGGATTCGCCAGGTGCTCAATAA 
CTTGGTGAATAACGCGCTGAAGTTCACCGATATCGGACGGGTGGTCCTGCGCGTGAAGTTGCTCTCCCGCAATGATGGTC 
GAGCCCTGTTGCAGTGGCAGGTCGCCGACACCGGTATCGGTATCGCACACGAACAGCAGGAGCGCTTGTTCGAGGCGTTC 
TACCAGGTTTCGGGAGCGCACCATGCCGGCGGCACGGGGCTAGGACTGTCGATCTGCTGGCATCTGGCGGAAATGATGGG 
CGGTCACCTGCGAATGGTCAGCGAGACAGGGCTCGGCAGCAGCTTCAGCCTGGTGCTCGAGTTGCCCGAGGACGAACAGT 
CCGGGCTGGCTTGCCGGCCGGGGCTCTTGAAATCCGCTTGCGTCCATGTGCGCTCGCCCGTGCGGGAGCTAGCCGACAGC 
GTAGGGGCGTGGCTGAAAGCCTGGGGCTGCAAGGTCAGCAGCGGCGAGGCGGCGCCCTCCGAGCTGGAGACTTGTGTGCT 
TCTGGAGCTGCTGCCGATGGCGGCCGGGCCTGCTTCTTCGCCCTGGCCAGGCCCCCGGGTGCGCGCGTCCATGGATGCGC 
CTTGCCAGCCGGAGCTGCGTGAGGACGGCTGGCGTGTCGGCCTGCACAACCTGGCGGGAATCGGCCAGGCCCTGGCGCAG 
GCTCTGGGTGGCGATATCCCCGAGCAAACGCCGGCAAATGCCTGCGCCCGCTCGGGGAGACTCGACCTGGAAGTGCTGGT 
CGCCGAGGACAACCCAGTCAACCAGGCGCTGCTTCGCGAGCAACTGGAAGAGCTGGGTTGTCGCGTGAGCCTTGCCGGCG 
ATGGGCGGCAGGCCCTGCAGCTGTTCGACAGTGGTCGCTTCGACCTCCTGCTCAGCGACGTCAACATGCCGAACATGACC 
GGCTACGAACTGACCCAGGCGCTGCGCGAACGAGGCGAGACGCTGCCGATCATCGGCGTGACCGCCAACGCCCTGCGAGA 
AGAGGGCGAGCGCTGCCGGGCAGTGGGAATGAACAGTTGGCTGGTGAAGCCGATCACTCTGCATACCTTGCATGAACTGC 
TCAGTGAGTTCGCTCGCGCAGGTGTCGTGCTTCCCGCGCAAGCGCGAGACCTCGGCCCGCCCGCGCAGCTCGACGACGGT 
CTCTCACCGCAGGTGCCGGAACGCATGCGCGCGCTTTTCCTTGAGACCATGGGCAAGGACCTGGAGGCCGCCCGGCAAGC 
GATTCGCCGCAACGACCCGAAGGGGCTGCAGCAGGACCTGCATCGCATGGCCGGCTCCCTGGCGGTGATGCGTGCGCGAA 
CGCTGGTGGTGATGTGTCAGGGCGCCGAGGAAGGCCTGCTGGAGTCGCGCCTTGAATGTTCCGCCGTGGAGATTGGCGAG 
GTGCTCGTTCATATCGAGCAGGCGCTGGAGTTTGTGAGAAAGACGGGCTGATGCGTCCGGGGTCAATAGTTGGAATTAGA 
ACACAAGAGAAGCCTATGAGTAAGCTCAAGATAGTACTGGCCGATGACCATCCGATCGTGCGTATGGGCGTATGCGACAT 
GCTCGAGCGCGACGGTCGGTTCGAGGTGGTGGGCGAGGCCTCCACGCCCAGCGAACTGGTCGAGGTGTGCCGGCAGAGCG 
AGCCGCATATCGCCATTACCGACTACAGCATGCCCGGGGACGAGCGCTACGGCGATGGACTGAAACTGATCGACTACCTG 
TTGCGCAACTTTCCTCGTACTAAGGTGCTCATCTTCACCATGGTCGGCAACCGCCTGATCCTCGACAGCCTCTACGATCA 
CGGGGTGTCCGGCGTGGTGCTGAAGAGCGGCGAACTCGACGAGCTGCTCTTGGCGCTCGACGTGGTGAAGCAGAACCGCG 
TCTACCGGGGCGCGAACATGCTCGACCCGACCAGTGTTCTGGCGAACCGCGACGAAGTGGAAAGCCGCTTCGCGCGCTTG 
TCGATGAAGGAGTTCGAAGTACTCCGTCACTTCGTTTCCGGCAGCAACGTCTGCGATATCGCACGGCTGCTGAAACGTAG 
CGTCAAGACCGTAAGCACGCAGAAGGTCTCGGCGATGCGCAAGCTGGAAGTGAACAGCGACCAGGCCTTGATGACCTTCT 
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GCGTGCATGC C AACTTGTT CC ATTG AC AAGTTTC C ACCG CGG AG CGTTCG AG A CGCGCCGGCGGATGTCTCAGTG AAGCA 
GGCTCCGTCCCTGCTGGAGATTGCCGCCGTCATCGATCCAGCTGAAGCGCACCTCGGCCTGGCCGCCGGGCAGCGCCTGG 
CGGAGCGCCAGGACGCGCCTGGAGAAGGGGGGCAACAGTTCGGCCGCCGCCAGGCGGGTGTGGCCGCCAAGCTCGGCGCC 
GGCGAGGCTGATGAAATAGGGAGTAGGGTTCTCCATCTGTAGTGTTTCGCCCCGCCGCTCGAACCTGAGCTGGTGTGGCG 
CTTCTTCCGCACCTCTGGCCAGAGGGCGAGGGCGGTAGATGACTTTCATCTGGGTGCGCAGTGTAACGGTCAGCCGTGAC 
T GT CCTTC GGC C AG C AGC C C GGTGGCTTGCGG CGGG ATTTCGT AGAGGTTC AAC CAGTAC AGCGATTCGCGGTCCCCTGG 
CAGCGCCTGGCCGGTCCGCAGCAGGCGCAGGCTGCGTTGCTGTCCGGGTTCGAGGCGGAACACCGGCGGTAGCGGCATGA 
TCGGCGCG CGGG C AG A CTG CGGTGTCG AGTT CGGGGCGC CATCGTCG ATCC AGGTCTGC ACTATG ACCGG AT AGGC ATTC 
TGGTTGACCAGCAGCAGCGACGTCTCGCGGTGCCCCTCGGCGATTATTGCCCGAGTGCGCTCGGCGGTGACTCCGGCTCG 
GGCGTCCAGCACGGCCAGCATTGCTAGGCCGAGCAGTGCCCCTTGCAGCAGGATCTTACTGGACACGAATCACTACCTGC 
GCGCTGGCCTCCACCCGTCCCGGGGTAACGCTGGGCATGCTGCCAACGGTGAGTTTTTCCAGGCGCGCACGGAAGGTCTC 
GCTGTACTGGCTGATGCCTCCCGCTTCGCCAGTCTTGTTCGAGGCGTTGCCGATCACTGGATACCAGCCCCGTGTTTCGG 
CATTGCTGCCCATCGCGGAATCCTCGCTTACCAGCAGGTTCATCGCACTGCCGTCGCGCAGCAAGCGGATACCCACGCCT 
TGGGCCATACCAGGCTGGCCGTAGCGGTCGGAAACTAGGTAGGAGAGCCCGCCATTGGCGTTCCTCAGGCCCAGTGCAGC 
GGACGCGGCCTGCGCCCCGGCTGAAGCCCTGATACCCAGTGCAGTACCGCTGGAGGCGGTGCTGTTGATCACTCCCGTTT 
GGCACTTGAAGGCCACCTCGAAGGGCATCTCACGGTTCGCGCCGGCATTAATCTCACTGAGGGAAATCGACGGGAACACA 
ACGTGGGGCGTGACGTTGGTTACGGAACAGGTGGGATAGCGCTTGAGCGTCACGTCGTTGTACAGGCCGATGGCGCCCGG 
CCAGTTGTAGTGCCAGCCGAGGTAGTTGGCGTTATGGCTGGCGCCGACGTTGGGGTAGGCCAGGCCGGGACCCTTGATGG 
CGATGTAGCCAGCGGGCTGGGTGTAGGCGTACAACCGCGAGGCGGTAGTCGGCGAGTAGTAGCGGATGAACTCCAGCGGA 
GCGCGGAACAGTTCGGCGCGTACCGCGCTGAGGTTCTTCGCCTTGACCAGTTGAAAGCCTCGCGAGTCGATATCGAGCCC 
jGCCGAGCAGACGCTCGCGCCAGACATCGGTGAAATACTGCCCGGTTTCCACGTGGGTTAGGCGCAGCAGCACGTTGGGCC 
AGGCGGTGCGATAGGCGGACTGCAGGCCAATCGAGTTGCCCGCACTGTCGCCTCCTAGGTACCAGCCGCTGTAGAGATCG 
TCGGCATTGGTGGAGAACATCTCGTACACATCGTCCTCTGGAGCACAGCGGAAGAACACGCGCTCTGGGTCGTAGCCGGC 
GCTGTCGCCGTATTGGCTGAGCGGCGCCACCGAGCTGGCGATCAAGCTACCGTCGGGCTGGAAGCTGGCGTCGCTGACGT 
TGATCACGCTCGGTAGACCGAGGGAACCGTTGCAGGTATCGCAGGCTCCGGCCCAGGTGCCGGCAGAGCCTTCACCGGGA 
CGTATCTGGGTGTTGGAGGTAGTCGTGGCGTTGCCTACCGCCGTGACCTTGTAGCAGGTCGCCCAGCTAGACGAGGCGAG 
CAGCGCGAGGAGCAGAGGCAGGACGCGCAGGGATGTTTTCATGGTGCCTCCGAGACGGGCGTGCAGGTGCCCTGCAGGCG 
GATCACCGCCTGTCCGGGTTCGATAGCCTGGCGAGGGCCAGTGTCGAGGTCGTAATCCAGCAGACAGGCGTCCCCTGGGC 
GTTCGCTCCATTGCACGCGCAGGCGGCCCTTGTCGCCCTCGGCGCGGGCATAGACCTGGCCGCCTTGACCGACCATGCCG 
ATGTTCACACCCTGGCTGTCGAGCACATTGGCCCCTAGCGGTAGCGCGCTGCCGTCGGCGAGTTGGGCCTGGATTAGCAA 
TGGGTGACCGGTCAGTGTGCGGAACTTCACATGCACGGCGGCGCCGGCGTATGGCGCGATCTTGCGCTCGGTCTCCAGCA 
GCTCGGCCTCTTCGCCCATGCCCTGCGGATCGAGGCTGACCGGGTTGTAGCGGTAGGGCGAGAGTGATGGCACCACGGCG 
TAGCCATTGCCGTTGACGCGCGCGCCCTGACCACCGCGTACGCCAGCTCCGCTGGCGCCCTTGGCCTCGACCAGGGCGAA 
AGTGTCGCCGACGTAGGGGCCGAGGGTCAGGCCGCCGCGATGCAGCACGGCGGCGCCGCGCAGGCCGGTGTTGAGCTGCC 
GGTAGCTGCTGTTCTCGGCATAGCCGGCGTTCAGCGTCGCCACCGAGGTCTGTTTCTGCAGACTGCCGTTGAAATCGCTG 
CCGTGGTTGTCGCTGTCGCGCCCGGCGGCAATCGCGTAGCTCAGGCTGCGGTCTTCGTCGAGGGTGCCGTTGAGGCCCGT 
CTGGTAGCTGCTGCCGCGGCTGTCGCCGGAACGCCGGGAGGCCATCGCGCTGAGATTCGGGGCCTGGCTGGAGGACCCCA 
GCGGCATGGAAAGTGTCAGGGTTAAGGTGTTGCTACGTTCGCTGCCGTGCCGCAGGTTGTATTGCGGCGGCAGTGACGGG 
TCGTAGTCGTCGTTCAGATCGCGGTACCAGGTGGTCTGCTGGCGCGAATAGGCGAGGTTGTAGCTGAGCTGGCGCCAAGT 
GTTGCTGTAGCCGAACTGCAACTGGGTGTCGCGGCTCTTGCCGTCGTAGTAGTCGCTGGTGGCTCCGGACAGATACAGGT 
TGCCGTAGCCCCCCAGGCCCTGGTTGACCAGCAGGGTGAACTGGTTGCGTTGCTTGTAGCTGCTGGAGTTCCAGGAGTCG 
TTGTGCTCATCGGCGTGGCGCGCCGAAAGCGCGTCGCCGAGGTCGCGATAGCCCTCGGTGGAATAGCGGTAGCCAGCCAG 
GGTGAGAGTGGTCTGGGTCGGCTGGAAGGTCCGGCTGTAGTTCAGACCGACACGCCAGCCCTGCTTGCGCTGGCCGTTCT 
C C AC C GT GGC ATGGG AAAAG ATGCTGTTG AAGCCGAAGGCTCCGTAGGGCG TGGCG AGC ACGCCTCCGCCG AGCAGCGCC 
AGATAGTCCTCGGCCAGGCGCGAGCCGAGGTTGGCGGTTAGCGAGTTGGTCAGGCCGCGCTGATAGGTGAAGTCGCCGAA 
CAGGTCGTTGCCGTCGCCATACTGGCGCGCTTGGCCGAGGGTGGCGCTGTAGCGCGAGATGCCCGGGCGCATGGATTCGG 
G AA CCGCGG AAAAGGGCACGG AAAAGCGCG AG ACGCTGC CGTCGGCCTC AAT C ACTTGG A CATCCAGATCGC CGTCGTAG 
GCGGTGCCATAGAGGTCGTTGATGACGAAGGGACCGGGAGCGACGGCGGATTCGTGGACCTTCTTGCCGTTCTGGCTGAT 
G ACC AC C CGTGCGTTGCTGTT C GC T GT AC CG CGTACCTGTGG AGC AT AG CGG CGTTG CGAGTCTGCCAGCATGCGGTCGT 
CGCTGGCCAGGCGCACGCCGCGATAACCCAGGCTGCCGAACAGATTGCCCTCGGTGAAGCTCTCGCCCAGGGTCAGTTCG 
CTGCGCAGGCCTGGCACCGCGCGCTGGGCATAGGTGCGGATGCTGTTCCAGTCGCTGCGGGTGTTTCCCGCATAGCTGGA 
GTAGCTGTAGTTGGACTGGTGGCGCAGGCGCCACAGGCCGAAATTGATGCCCCCGCTCAGCCCCAGGTAGCCATAGTCCG 
AGTCGCCGTTGCCGCCTACTCCGTCGAAGCTCGAGCGATAGAAGCTGGCGTCGTAGTTGACGAAGCCCATGCTACTGCCA 
GCGTCCCACTCGTCGGGACCCACGTAGCCGCGTGGCTTGATATCCAGCAGGGCCTGGGGCACCGAGAGATCGAGGCGCAG 
AAGGGCGGTGTCGAGATTGAAGGTCGAGCCTGGCAGGCGTTCTTCCAGCCCCAGGCAATCTCCAGGCGCTTGCACGCCGG 
CCTCAGAGGCGGGGCGGACGCCCAGCGTCCGGCGCAAGAAGCGTTCGCCGAAGCAGGGTTTGACGCCGCCGGCCCGGGGG 
CGGAACTCGATCCTGGTGCGACTGGCGTAGCTGCCGTTGAGATAGACATCGACATGGTAGGTTCCGGGGTCCACCTGGTT 
GGCCCGGTTGAAGCGTTCTAGGGTCCCGCCGGCGAGGCCCGAGCCCATCAACAGGCTGTCGTCGAAACGATACTGCGCCT 
CTC C ATTGGCC AGGCC CGCG AAGGCC AGCGCT AGGC CGC CC AG AGC C AGCGG C AGTTTGCCGGTGCGCCGTGCCTCAACG 
TGGCAGAACATGCTGGACCTCCATTCGCGCGCCGTAGTCATTGATCAAGAGGGCGTTCAACTGGACCGGTCCTGGGGCGA 
GCGGCGAAGGGCGTTCCGCCGCCCACTCGGCCTGGCCTTTGGGCGGGACCATGCTCGACAGCAGCCGCCAGCGACGTTCG 
CCGATGCTCAGCATCGCGCCGGCGAAGGAGGCGTAGTAGCCGCTGGGGTTGTCCACCCGCACACGCCAGTTGCCGCTCGC 
CTGCACCAGGCTGAAGTGCAACTGCTCGACTAGCTGCTCCGAGCCGCCAAGCAGCGCGGCGGGGCGATAGAACAGCTTCA 
GTCGACTGCGCTGCATCAGCAGCATCTGGTTGCGCTCGGCCTTGGCCAGGTCGCGGGGCGGGATCTGCAGCACATTGAGA 
TGGAACAACGATTCGCGATCCTCGGGCAGCGGGAGCCCGGTATACAGGAGGCGCAGGGTCTGGCCGCTGCCGGGGGCTAT 
GCGCGTCACCGCTGGGCTGACCAGAAAGGGCGCGTTTGCAGTCTCCGGGGTGGAGGAGGGGTCGTCGTTGTCGATCCAGG 
CTTGGATCACGTTGGGGAATGCGTCCTGATTGATCATCTGCACGGTCTTTTCCCGGGCGTCGCCGGGATAAATCACGCGA 
GTACCGATGATCACCACGCTGCTTTGAGCCGGGATGGCGGGTATCAGCAGGGCGAGCGCCAGCGTAGCCGCACGGAGCGG 
AGGCAGTGGAAAAGTATTCATCTGTTCACCTTGGCGCCGTGGCGCTCAGGAGGGGCCTCGGGTACTGCGGCGGAACCTGG 
GAGGGCGCCGCTGGGCGGCGCCTCCGCTATTACTCGTAACGCAGGGTGTAGGTCACCGAGCCGAGCACCGGTCCCGGAGT 
GACGGTGGTCGCTTCGGAGATGTAGCGGACGGCATAGTCGTAGCTGGTGCTGGTCTCGCCTCCTGCGAGCACGATGTCAC 
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CGGCAGCCACGGCCGACCCGCCGGCCAGATTGACCTCGCTGCCGCCCGGTGCCGTCAGTAGCTGCAGCGCTACGCCGGTG 
GCGCCGGCGG ACGCGG T AT TGGT C AG ATTGCCGGCGCTGGTCGG ATTG ACCGCCTGG AAC ATCACGCTG AAGTGCTCCTC 
GGCATCGGCCGCCGGAGCGGCGCAACCGGTCAGTTGCAGGGTGAAGCTGGTTTCCCCGGCGGGTTTGCCGACTGCGCCGT 
CAAGAGCGCTTACCGGTACGGTGTCGAGTATCACGGTCGGGTCGGTTCGTCCGTCGACGACGGCGGAACAGGTCTGGTCG 
GTCACTTCTCCGTGGAAGGTGATGGTATTGGCAGCCGCCGCGATACCCGGCAATGCGAGAAATGCGGCGATAGTCAGGGC 
AGTACTTTGAGGTTTCATTTCAGGGTTCTCCTTGTTGAGTTTTTTCCTCCGAGCAATGGTCTGTTAAGAGCTATCCGGGC 
GTATTTAGGA CGTTTCTT AAAGG A C CG AAAG TGGGTTTCT CGCTCTCGG C AAATGTTTTTACTGC AATGGACTACCCCTG 
CTCCAATAGACATTTCCGGGCCTCATAGCGAGGCGCTTTCAAAGGCCTCGGGACTGACGCCACCCAGATGACTGTGCCGC 
CGGGTTCGGGTGTAGAAGACCTCGATGTAGTCAAAAACATCCGCCCGGGCCATGTCTCGGGTTTTGTAGATGCGTTTGCG 
GATACGCTCTTTCTTCAAACTACTGAAGAAGGATTCCGCCACGGCATATCCCAACAATTGCCGCGCCGGCTCATGCTGGG 
CTCCAGGTTGTTGGCAGCACAGAAGCGCTTCCAGTCGTCACTACCGTGCGGACTCCTATAGGTACTTTTTGAAGGTGGAT 
GCTGTG ATGCA CA CCGCTA CGCCG AG C AGTGCGG CG CAGGGCAACAGG ATG AGCAGGGG ATTG ATGTTGATGGGAATTGC 
CAGGTAGAGCGTCCAAGGGACGACCGCTAGCGGAATGATGCTGCCGCGCGCCTTGTGGTAGAGGTAGCTGGACTCCCGGC 
CGGCGCCGAACTTGCGCAGGTCCCGGCGCACCAGGCCGTCCACCAGGCCGGTGAACGCGGCCATCACGAATAACGGGATC 
GTCATGACCAGGATCACCAGGCGCACGACGAATGTCAGCACCGTGTACAGCGCGGCCAGGCCGTAGTTCTGCAGCGTGGA 
GACACCCTGGGCGGTGAGATAGCGAACGTCCAGCGGGCTCCGTGGCCCGGCCTGCGCGATGGTAGTCATGTTGGTCATCC 
AGTCG AC C ATC C CGGTCTT CA CGAA C AA CCAGTC AT AGGC C AA CTGGGC C AGCCAGGTGGCGGTTCG ACCTGGCTC CTGC 
ACGACGACGCTGTGTAGCAGCCCCTGCGACAGCCAACTGAGTTCGTACTCAAACATGGCCTGGGCATGCTTCCAGCCCGC 
GTCAGGC C AGAAG AAATA C AGGCAAA CC C ACTCC ACG ATG ATTGCGCCG AT C AGCGACCCGAAC AT CACGCCG AG AAATT 
GGAAGGGCGTACTGATGATGGTGCCGAGAAGGGTCTTCTGGCTCTCTTGTTGCTGCTCTGCTCGTTGAGTGACTTCAGCC 
ATCAGGCGGCATCCTCACCGGTGCCGAGGTGACGGAAATCGTCGACCAGGTCCTGGGGCAGACCATCCTGCAGGGCGGAG 
TATCCCGGCGCCTCCCACCACTCGCTGTTGGCGGCCTGGCCCTTGCGCATACCGGCAGCCAGCTCCTGCAGGCTTTTCGG 
CATCACCTCGTCGGGGGCGACCGCCGGCAGCGGCATTCGGATCTTCCAGAGATTGCCGCCCTCGAGTAGCGCGAACGCTT 
GTCCTTTGGGCAGACCAATGATGTGGGCCGGCTCGAGCATCGGCACGCTGGTCATCTGCACCTGGTCGTGCGAGCTGGAG 
GTGAAGGCTACCTTCTTGTTGTTGTTGATCGCGTCGTTGGCGCCGCTCGCCGGCGTGCTGGTGTAGATCTGGACCTTGGG 
GAGCTGATTGGTAAGGAGTTCGGCCGTGGCGGTCTCGCGCACCCGCAGCATGAACAGGTTGTTGAAGTTGCCGATGATCT 
GACCGGCCTTCGCGCGGGAGCCGATCTTGGCCTCGATGTCGCTCATGGTCTGGGTGTAGGCCGTCACCTGCACGCCGGCG 
CCGCCCGCTTTGTTGACCATGGGGATGAACTCGTCGCCAATCAGCTCGTTGAACTCGTCGGCATGCAGGTTGATGCGGAC 
CTTGCCGCTGGCGAGCGAGCCGGGCAGGCCGTCATCGACACCATGCTTGTAGATGTGACCCGCTACCGAGACCAGGTCGC 
TGAACATGGAGTTGCCCACCGCGGCGGCGACCTCGGTATCCGATAGTGCGTCGAGGCCGACGTAGACCACGGCGCGTTTG 
CGG ATGAC CTGC AT C CAGTCG AAG ATCGG C CGCGG ATCGTTG AGGTCCGCGT AGTTGGGCGAAAG AAGCTC CG AGATCCG 
CCCGGTAGTGAGTTTCTCCAGTAGCGGCAGCAGCGAGGCCACGATCTTGTCGAAGTAGGTCTTGTCGTAGCGCACGGCGC 
TCTTCAAGCCTTCCATGACCGGGTCGGCGATGCGTTTCTGTGTCAGGTACTGGTCGATGGCTACGACCCGCAGGGGCCGT 
CCTTT C ATGTTG AACGGG AT GTT CTTGT C GTTG AG CTTGCCCT CG ATCTGG ATGATGGTGTCCCAGGCCCTGGGAT CGTG 
CTCGCTGATGTATTTCTGCGCATATTCGACGAACAACGCATCGATGTTCACGACGTGCCGGAGGATCTGCTGGTAGTCAG 
GCCGGATACCCAGCGCGTGCAGCGCGCGGGCGATGATGTTGACGAACCGCCAGGCGAACTCGCGGAACGCCGCGCTGTTG 
CCCTCGCCGGAGAGCTGGCCGGCGACGCGGGTGGCGACCTCGGAGATCCGACCGAACCGGCCGACGGCGTTGTAGCGTGC 
CGACAGGTCAGGATGACCGAGGTGGAACACGTAGAACTCGTCCAGGCGGCCGGCACGTTCGCATTCCACGTACATACGCT 
TCAGCAGGTCAGCGTCGCCTTTCGGGTCGAAGACGATCACCACCTCGTAGTCCGGCTGCTCCTCTGCGCGCCGGCGCCGA 
TAGCCGTGGTGAACCGTCTGGGTCCGCCGGCCCATCTTCACCCGCCGGCGTCGTACCCGGCAGTGAGTGCGGCGAATATC 
CTGGGTGATGAACAGCTCCGCGAGGCGCGTCTTACCCACCCGCGTGGTGCCGAGTACCAGTGTGTGGCCGACGCGCTCGC 
CCAGTTGCAGGCCTACGTCCTGTTCGCGCGGCTCGATGCCATGGAGCCGAGGCAAACCGCCCACGGGCGGCAGCGGCCGT 
GCGGGGTTCCAGGCCACGTCCCAGGCAGTGGCTTTGGCGACCAGCTTCAGGGGGAAGGGGGCGAACTCGAGCTGCTTCTC 
CAACCGGCGCGCGCGCTCGTAGAGGGGCGAGGGCTCGACGTAAGAGGCGAACTGGGGCAGGTAGGTATCTGCCAGGCGCT 
GCGTATGCTTCTGCGTCCAGCGAAATCCTTTACCGATGAACAGGTGTTCGTTGCTGACCGGCATCTCGGCGCTGGTCATC 
GTGTACTTCGGCAGTCGGCGAATGTTCCGCCGGTAGCGGAGCACCACGCCGGCCTGCTTCAGCCGCACGATACCCAGCCA 
GGCGAAGCACAGCGCGGCCACGATGCCGAACAGCGGAGTGAGGGAGAACGTCCACGGCGCGACGATGCAGAGCGCGGCTG 
CGGTGAAGCACACGGTGGTGGTGTAGAGCTCCACGGCAGGCCGCAAGAGCGCTTCCAACGGGTACTGGCCAGCCATGGCT 
TACACCTTCAGGCTTGCCGGCGCCTGCAGGCGGCCCTTGGCATGAAGTTCGAGCACCTGTTTCCTGGCTTGCTCTACCTC 
ATGATCAGGCAGCATCTGCACCAGCAGCTCGCAGAGGGTCGACTCGTCCTCCAAGGCCTTGGACGTCACCTCACCGCCGT 
AGCGTGAGATCAGTCCGAGCAACTCCTGGTACATGAGTTGGCGTGCCTGAGCGTCGCTCGGCTGCTTGATGACTAGGGTG 
ATCAGATGAGTAGTCATGAGGACTCCTACTGCTGGATGGCGGTGGATGTGATGAGGGCGGGGTAATGCTGCAGCCCTAGC 
CGGTCGACCAGGTCGTCCGCCGGCGCCGGCAATATCTGAAGTCCTTTCCCCCAGGCCCGGATTTCCGTCAGGCGCGCTTC 
GCTGGCCACGTTCACTGCCAGGCCCACGGCTTGGAGGTCTCGAAGCTCGTCGCCTCGCTCTTTCAGCCAGGTTCGAGACA 
GCGTGTCGTCGCCGACCAGGAACAGCAGTTGCAGGCCTGGAGCGTTGATGGCGCGCCCCTGGACCCGTCCTGGCGATAGG 
CGGGCGG AGCG AA CTGG AAA CGCA C CTG AGCC ACGG ACGC CC AGGTCGCCAGGTCCTGGTGCGGAAGCGG ATGGCTGCGG 
ATC C AGGCCCTGG T AGTAGGGG AG CGCCG AGGCGCCGC CG AGGT CTT CGACAACG ATCAGCCCGGCGGCCCAGGCTTG AG 
TCGCGAACGCCGTGCAAAATGCCAGCGTGGCAAGGAACTTCTTCATGGATTGGTCACCAGTAGGTTGACGCCGAGAACTT 
GGCTAAGGTGGCGGG AAA C CGT CCTA CGGTATTTGGCGG C AGGCGCGC CGCCGGCGGGGCGGTGGTAGCGGCCGGC AACT 
TGGATCCAGTCCACCGGTCTTCCCGGGGCGGATGCATACAGGGCGTCCCGCTGCTCGATCAGGATGTCGGAGGTGGCGTC 
CAGGTTCTTGTACGGATCCAGGGACTCGCAGGGGCTGGAGAAACGATGTCCGTTCCAGCCGATGTTGACCTGGCCGAGGC 
CGGAATCGACGCTCTTGGCCCCGTAGAGGTTGATCGCCGCGAGTAGCGCTGTGCAGGCGGCGGTGCGGGTCGCGTAGTAG 
TAAGATTTCCCGGCGACGTTCAATGTCCAGGGCCAGGGAACGTATTCGCCGCGCAGCAGGACCTTGCTCTCGGTCAGCGC 
GACCGAGTAGAGCACCTCGGCCGGAACCCCTTTGGGCAGGGCGATTTGCTTGTACGCCGGCGGTGGATCCACGGCGGCCT 
GAGCGATGTTGACCAGGACCAGCAGGGTGGCCAACTGGAGGGCTCGAACTACAGACGTTGCCATCGGCCGTTCACCTCCT 
GTACCAGGGCGGGAAGGGCTCCGCCCAGTCCTAGGGCCATCCAGCGGCCCTCGTCATGATTGAGCGTGATCTGCTTGCTG 
CGAACCTTCTTCGGGTCGATGCCGGCGAGGATTGCCCAGCGCCGCACTCGCTCTGCGTCGTTCTGGCTACCGACGAAGTA 
GAGGTCGAACTCCTTTTCTGCATGTTGCAGGTCGCGGACCCGCTGGATGCAGGCGGTGCAGTTGTCCTGGACGAACAGGG 
CCAGGCGC CCGCTGCT CTGCAACGCTGGG CTC ATGTTCGG CGTGCC CG ACGGGTTGGCTGTGCTGCTTTCGGTG AGGCGG 
ATCACCCCCTCGCCTGGATAGGCGCGGGCGAAGGCTTCGTCGTATGCGCGCTGGTAGGCGAGTTCCTTCTCGACCCGGCG 
CCGTTCGGCCTGGACCTGTAGATCGGCATACCGCCGCCGTTCCTCTGCCGATCGCGCCTCGATGCCCAGCGCGGTCAGCG 
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GATCAATACCAGGCGAGTAAGCGCCCCGGGGGCCTTGCATGAGCGTCTGGTAGCGGGTCCATTCCTGTTGGTTCAGTCCC 
CACTCCTGGGCCAGGTCGCTCTGAGATGTCTGTTGTTTCAGCGTGCCGAGTTGGCTACCGGACAGTTGGCTGCTGGCCTC 
TCGGCTTGCCGCCGCCTCCTGGCCGGCTGCGGGATAGCTCAGCAGCAGCATCAGGCCGGACAGGAGCGATACGGTTCTGA 
TCATGGGCGGGGTCCTATGGTTGGATGCGAACGCTGCGCGAGGTGCCGGCGACGTCGAAGTGCGCGGTACCATCGTCAAG 
GTCGGTCAGTCGCCAGGTCGTGCCGGCGACGGCATCTCCCCGGCGAATGAGGTAGATCTGGCTGAGCTGGGTGGATCCCG 
G AGG TGC AAC CG AC AG AAACCGTTC T CCCCCGCG AT ACTC C ACGCC AAGG ATCG AG AAGGGCGGGGGCTCCAT CGGTTTG 
GGTTTCGGCTTGGGCTTGGCGGGTATGGGGCGTTTTGGCGCTGGCACGATCAGCGGCGGCGCCTTGGCCTGCTTTTCTTG 
AAGCG TGCGG AC AG AACCGT CCAGTG T CTCC ACGG TTGCCTTG AGCAC C AAG AGGTCGCCGGCCG AGGCGGTGGT CTG AG 
CCAGGTTCTCGACGGCATCGGAGGCCTGCTTGGCGAACGCCTGCGCAGCGTCAATTCGGTTGGACAGCGCCTGCTGGCCT 
GAACGGAAGTCCTCGTTGCTGACCAGGTGCTGTCCATCCACGGCGTCGAGGCGCTCGTCGACTCGACTCAAGCGAGCCAG 
GATCGCCTCCAGCGAGGCCTTTTCCGCGGCACTGTCCACGCCTGATCGGAGCTGAACGAGTTGGTACTGCTGGTAGCTCA 
GCAGGCCGGCCATTGCCGTCAAACAGAGGCCAAGAATCATTGATGCAGGGGATGGGCGTTTCATGAGGGTCCTCGCGCGA 
AGCTGGGGACAACCTCTCAGACGGACAACGTTGATGCAGTAGGAAATACTCGGCGGAAGGGGGGAGGATTTTTTGAGGGG 
AGAAATTCTTGCGAATCCTCCGGCCTAATGCTCTGTGCGAAGCTGACGAAGCTCGGGGGCTCTTCGTCGCGCATCAGGGC 
TGAACGCTGGATAGCACAGCAGGCTGACCTGCACCGAAAAAACCAAGTGATTGGCATCGCCGCGTCGCTGGCGGATCGAC 
G ATGGGGCGG A CTCGTTG AGG TGG AGT AGT C AGTG AGG TCATTCGGCTT AC CGGCAGTG CAAATAGG AT ATCCCGCTCCG 
TGACCGAGAGGATCAAGCGGCGCGGCATGTCTCAGATGTCTGTGCGGGTGTGGTGGCAGAGAGTTTCCCGGCTGATATTC 
GGAGGTGGGGAACAATAGAACCAAAAACCAACGTAAGCAGAAATGAGTGCTGCACCCAACAACGCCGCCACAATGCCACT 
ATCGATTTGACGATACCGGTATACGTAAAAGCATGGAGAATCAGTATGTTACTCACCCCAGACGAAGGTTAGGTTGACCA 
TTGCGCTGCCTTCTGAAGTTATGGGGCATAGATTATGGCCGAAGCTATCAGAAAGGATGCAATGATGACAAAACTCTACT 
TTGATCTTCTGAACTCGCCTGCCGAGGCTCATTCGTCGATACAAAAGTCT.TTATCTGT.GCAGGCAATCTCCACAACTGTC 
CCAATACTGGAGTTTCCTTCGGAAACCGTATACGCCTATGCATCGTACATAAATGCATTAAGTATCGGTCAACGCATAGA 
TCCTGCATTCACCCAGAGCTTAACGAGTGCCATATCCAACCTGGCAGGTCGCCCGATTGCAGTAAGCGACATTTACCAAA 
AAATTCATGAAACCACACTGAGAACACCTGTTGAGATGGGCGTTCGTCCTAATAGCATCACCTTTGAGGAGTATCAGGCC 
ACCATAAATCAGCAAGCCATCAACATGGTTCAAGATATGCAGGATGGAGACAAAGGTGAGAAGGTGGAGGCCCTCCAGGC 
CAATATGCAGTTCCTGTATGGACAGGAGATAAATACTGATTTCATCGCTCGTAATGAACTCGCTGCTGGGCAGAGAGCGA 
AAACCGTCGCAATAGTTCAGGGGCATATCACCATCGGGTACGGCTTCGATACCTTCGTGCATGAAGCGTCCGAGCTAAAC 
TCTTTGAATCTTGTTGGTTCTACGCGACAGAAGGTATTACCTGCATTGCAGCTATCAACGTCCGACCCAGGCTTCTGGAG 
CGTCTATGCCTTGCTGGGACAAAGTCTCACGGATGACGATGGGCTATTACTCTTTAGTGCCAAAGCGCGAGCTGTTGTTC 
AACGCATAGCAAGCAACCAGTTTGCAGGTAAGTGGAATGGGCTACCCCCAGCTATCAAAACGGTTGCGCTTGATCTATAT 
TATCAATATGGGCAGACTGGTAATTTTCCAAAATTTCAACAAGCTATAAATAGCCATGATTGGCCGGCAGTCATCCATGA 
ACTTAGAAACTGGAATGGTGTACCGAATGATCCTCTCCAGTTCATTACAAAACGATTGGAAGAGCGAGCCAAGTATCTGG 
CAATATCCTTCAACTATGAGCAATGAGGATATGGATATGAACAACACAGTGAGCGAAACGCAACAGATCAATATTTACCA 
AAATCCGGGGCAGTCTATTTCCGGTCTCTACAAGGGGCTGGCTAACCAGTGCTCTCCTGGCCAGCCATTTCCAGAGGTAC 
AGCTTGTGGAGGCTTGGGATATCCCTCTCG.TACTCCATCCGGAGTTTGTGCCTAACGGAGATGTCTCGAAAATCGATAAG 
GAGTACGGAACGATCCTTGCTGCTGAGTCAGCTCAGGTTATCCTGCTTCAACTCCAAATGGCTCAAGACAAGGCTAAGGC 
GTGCGGGGAGGTTACAGCCTTGATCAGTTCTGTCTCCTCCAATCTCAATACCATTAAGAGTCGTCATGGTGCTAATTATC 
TAAACCTGCTGAAACAATCACCGAACCGATACCCGACTAGCGTCGGAGTTGAGATCATGTCAGGTGGCAGTCCGAACCAG 
GATTCTGGAATCGAGGTCTCTTACGGTGCCAGTCTCGGCCGTCTAACTCAATCACAACTTCAGGCGATGAATCTGCCTGC 
CAGTCTCAAACAGTTGCTCACTCAGGGAATCGGTGTGAAGCTTTCTCAGCCTGAATATTGGCCTGCTTACAACAACATAG 
CCACTGGTATTCGTTATACAACCGGAGTGGCGATAACGTTGGCCTATTGGGCCACGGTTTAGAGATGGTTGAACATTAGG 
GAGAAAAGGATAGGTGCCCAGATCAATCTGCTCGTTCATTTTTTTGCTCATATTCGCGCCCACAATGACCCAAGCTGCGA 
AAATACCAGCAAATGAGTACTCATTGGGGGATGGAAGAGGCTACATCAATATCTGGCCGGAAAAGGATGAGGCTCAGGCA 
TTTCTTATCCATAATGATGGGCCTAATGGGGCTACATGCAGCCTTAAAGGCACTCTTAGAGATAATAAAGGAGTGGTGCA 
TTCGCCGTATTCCTCTGCTTCATGTTTGCTAAGTATCACCCAGACAGGGCTGCTGTCAGTAAGCGTCAAACGTGAGGAAA 
ATTCGCCAAGCTGCTCTGCATGGTGCGGTCCTAGAGTTTGGTTTGAAGGAGCCTATAGCGTCCCGCCCAAGGGCTGCTAC 
TATATGCAAATAAGGAAAAAAACTCGACAAATGTTGGGTATGATTGAGAAAAAAGAGCTTGATGCCGCTCGCGCCTTATC 
AAATAAGCTTTTGTCAGACTGCGCAACCGAGCTAGCCTATCCTGCCAAGATATACTTGACGAACACACTTGCCATGATCA 
GTGCTGAAAAGGGAGAGAATGCTCGCTGTTTGGAGTATGCCCATCGGGTGCAAAAGCAAATTCCTGTAAGAGATGACGGC 
CAACCGGCTGAAGACTTGCTCCCGGCGGAGCACGCTTTCGCTATGGAACAACGCGCCAAGGCTGATGCTCTGTCTGAGCG 
ATGCAGCGACGAGAAATAATGACGCGTCGTAGTAGGCGCCTAGTTCACTAGGAGCGATCTTTTCCGACTCTCCCGCGGCA 
TGAGTCACTCGTTCGTGCGGAAGTGCTGGTAGAGCGTTTGCCGACTGATGTTGAATTCGCGGGCGAGCTGAGCCTTGGGC 
TCGCCGGCCGCTGCCCGCAGCCCCAGGGTAGCACCTGCTTGTCGGACAAGGCCTCTTTGCGGCCCCGGTACGCGCAGAGC 
TTGATATCCTCGCGCTACCGCGCTGGTGCTGCTTGTATGCTGCTCAGCAAGCCAGCCGCCGGTCTTTTCAGGGTAAGCGT 
ACGGCCAATACACCTTTACTTAGGTTGAGCCGTATCAGCAAAAGAATCGATTCATATGTGTGAGGTACCGCCTCATGCAC 
ATTGGCTTTGGTACCGACATCTACTCGGAGACCCTCGTGCCTTGGTGAAACACCATGTGCCAACCATCTTCGTAGGTTTT 
CCAAACAGAGCTACGCATTGATAACGACTCATCGCCATTAGCATTTTGATGACGGCATCGGTACGTTGCTAACGCGACAC 
CATCTGCAAGCACACGCAGTTTGAAATCCTCGATTGTCCTTTTGATCCAAGTCTGGGATTTTAGTCCCACGATCACCTCC 
GCTTTCGTCCAGCTTTTGCCGATAGCTCCAAATTCAACGAAGTCTTCGGTAAGCAGCTGTTTAAGCAGTCGATCATTCTT 
TCTCGATTCCGGCTCTAGCAACAGCTGTTCAAGCGCTAAAATCTCCAGGCGAATATCCATTTCCCTCACCTACCAAGTAT 
GATTTATATATGTTTCACAGGGGACAATAGATCACATAAGCTAGATCTGTACCCACTGATGCGGCAGCAACTGGCTGATG 
TCTTTCGACCGTAACGTCGGCAGGCGAGTTAGCACGTCCTTCAGGTAGGCATACGGATCATGCCCGTTCATGCGAGCGGA 
CTGGATCAGGCTCATGATAGCTGCTGCTCGTTTGCCACTGCGCAGCGATCCGGCAAATAACCAGTTCGAGCGTCCGAGCG 
CCCACGGCCGTATCTGGTTCTCCACCTGATTGTTGTCGATGGGCACAGCCCCATCGTCCAGGTAGCGCGTCAGCGCTCCC 
CAGCGTTTCAGGCTGTAGTCGAGGGCTTTAGCTGTGGCCGAGCCGTTGGGCACCAGGTCGCGCTGGGCCAACATCCAGTC 
ATGCAGTGTTTTGCTGATCGGTACCGCCATTTCCTGACGTATTCGCCAACGGTCTTCGTTGCTCATGTCCCGAGCCTGGC 
GTTCAACCTCGTACAAACCGCCAATTGAGTGCAGCGCCTGTTCGGCCAGTTGGCTTTTGTTAGCGACATGCAGGTCGAAG 
AACTTGCGGCGAGCATGAGCCATGCAGCCGATTTCAGTGATGCCTTGTTCAAAACCGGCCTTGTATCCAGCGAAGTCGTC 
GCAGACCAGCTTGCCATTCCAGTCGCCTAGGAAGTTGCGTGCATGTTCTCCGGCACGGCTTGGGCTGAAGTCGTAAACCA 
CCGCTTTGAGCGCCGAAAACGGCGTCGTGCTGTAGGCCCAGACATAGACCCGGTGGGTTTTCTTCTCGCCTGGTGCAAGC 
ATTTGCACCGGTGTTTCATCGGCGTGGATCACGTCCTGGTTCAGCACGGCTTCACGCAGTGCATCGACCAGTGGCTGAAG 
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CCGCACGCCAGTTTGTCCGACCCACTGCGCCAGGGTCGAGCGGGCAATTGGCAGCCCGGCGCGGCCAAAGATTTTTTCCT 
GTCTGTACAGCGGCAAGTGATCGGCAAACTTGGCCACCATCACGTGGGCCAACAAACCTGCGGTCGGGATGCCTTTATCA 
ATAACCTGGGCTGGCACCGGCGCCTGGATCAGGGTTTCGCACTGACGGCAGGCCCATTTGCCCCTCACATGTTGCTCGAC 
GGTAAACACGCCCGGCGTGTAATCCAGCTTCTCGCTGACGTCTTCGCCGATGCGTTGAAGTTGGCAGCCGCAGGCGCATT 
GGGT ATTTTC AGGTTCGTGGCG AATCA C CGTGCGCGGG AA CTG CGG CGGCAATGGCG AACGTTTCGGGG ATTGCCGTGGC 
TCGGCTTGTGGCGAAGCTGGAAGGAGTTGTTTCAGCTCGGCCTCGATAGCTTCAAGGTCGGTGTCGAGCAGGTCATCCAG 
CAAGCTGCCTTGCGCCGAACTGATTTGCTCGCTGCGCTTGGCAAACTTGTGGCGTTTGAGCAGAGCGATTTCGAACTTGA 
A CTG TTCGATGAGGGT TTCATTGTTGCGG ATTTTC CTGCT C AT CGC CTCGACCTGGGATTGCAACTGC AACGCCTGTGCC 
GCCAAGGCACGAAGCTGTTCCGGGGTCATCTGGTCAAGGTTGGGCGAGGAAGTCATGCCGCCGATTGTGCCAGAGCAGGC 
GCGATGCGACGACAAGTAGACCAGCCAAATGGCCGGCGTTTACAGCATGCTGATCACACCGCCTGTGCCGACGCGCTGCC 
ACGGCAGGCCCAGCACCAAGGCCTGGAGTTGTTCGCTGTCGAGTTCGACCTCGCAGCCATGGCGAATGCCGGGCCAGTGG 
AACTTGCCTTGGTTCAGTCGACGCGCGGCAAGCCAGATGCCCACGCCATCGTGCACCAGCACTTTCATTCGGTTAGCCCG 
GCGATTGGCGAACAGATAAGCGCAGTGCGGCTTCGCCGCACCGAACACCGCAATTACCCGGGCTAATGCCGTCTCGGTGC 
CGGCGCGCATGTCCATCGGTTCGGTGGCTAGCCAGATCGCATCGATGCGGATCATCGAGCGAAAGCCCGGATGAATTGCG 
CGCAGCCCTCGGGATuJGAGGCTGGCTATTTCACCGTGATCATTTGCCCGGCCATGGGCAGTTCAATGAGCACTGACGTT 
TCGGCTGGCCGTTTAGGGGTGGCCTTCAGCGGGACAAAGGCTGGTAACGAGGCTGGTACGGGCTGGTCTCGATAAAGCGT 
CAGC C ATT TGCGG ATGA CATTGG CGTTG ATGCCGTGGCTG ATGGCGA C ACTGG AC ACCG TTGC CCCAGGTTGC AGGCATT 

CCTGAACGACCTGGGCTTTGAACGGTTTCGGGTAAGAGCTTCGTTGGCGCATGGAAATCCTGGCGATAAGGGTGATCGCG 
TCCGCTTAAAAATACGCGGACACCATCGCCCTTAATGCTGGAGTTCGGAAGGTGAGTTCGCCGGACGCTTACTTTTCAGG 
AGACTACGAGTCATGTGCCGATGCAGTAATTGACTTGCCAATGAAGACAGTCAAGATTGACGCTTTAGTAATAACGGAAA 
-XTATCATGA'CATAGGTTCTAAAAGAGGCTCCATTGGAGCCTCGTATTTXTGTATGGTCGGCGTAGCACTTACTTTAGTGA 
TACATGCTTACCCATGAACTGTATTCCGTTAGATAAGGAGCTGACTT-TAAGTGAGTTCTTGTAGCCATAGAACTCCCCAA 
GACCATCCCCGTCGGGAATTTCAGTGATTCCATTTGCAATGACCGTATGGCCGCCTAAACCTAGAAGCTTGGGAGCATAG 
GGAACTGCTCCATCTGAAACATAAAGGGTGGGTGAGCGATGGTCTCTATCTTGCTTCCAAGAAAAGTACGAGTGTATAAC 
TCCTGCTAGTACCTCGCTAGCCGGATCATTCCTGGAGAAGAGTACGACATACGGCCGCCATCGACTCTCATGCTCACATG 
AAATGCCTGTGTGGAACTGTCGATTCTCTAATTTTTTTATGAGATAAAGTAAGGTTTCCTCTGAGCTTTTTTCGAGAGGA 
GGTCGGCGTATATCAACTTTAGATATTTCGCCAGAGGAGGATGGCTGCATCCATAAGGGAGGGTATGCGCCTGAAATGAG 
TTGATAGATCGCTGACTTTATGGAGAGGAAGGGGACGCAAGGGAAGTTGGAGGAGCCAAATACTTGATTACGAGACTTTG 
CATGCTCATATGCACTGATCTCTAAGTGAGTTGACCATCCTTTATTACGTATTTCCTCTTGGATGTCGTTTGGCAATAAA 
CTGACAATTGACTCGATCCAGCAACGAGCCTCGGTACTAAGCGAAACGGGAAGGTTTCGGTCTCGCAAGTCAAGAAATCG 
AAATTTATTTAGAGTTTCTTCGACTTCTAATTTTTCTATTGCGTCGAATATATGGGTTGTTTGGATTGAGGTCTCTTTTG 
TTTCGATGAGGATTAGTCCGGGTAGTTTCTTAAAGCTCATTTGCTCAAGAAAAAGGACTAAGGCTGGAAGTATAGAGTGC 
TTTGGGTTTCCAGTAAAAACGACGGGGCCATTCTCGGTATATTCAAGGTCTAAACAATATAGCGCTTCCAGTACCCCCTG 
GTGTGTATCGTTGCGGCCACGGATCACCCAGAATGGAGCGCTTCTATTCATGCCAATGTGCTGCTCCAGTTTATCGACAT 
CAGCGAGTGAGCGGAAGCCGCGAGAAACTGCTACCGCATTTTGGCAAACCTTGAGTGGATATTCTGTTCCAAACACAAGA 
AAGGAGTTCTTCTGCAAGCGTTTGGCATCGCGTTTCAACGCCGAGATGTTTCGTGAGGCGGACGCAGTGAGACTACGAAT 
AAG AGAT AAAGCC ATG TTTTCCCCTTG TGCCGTTCGTGGCA CTT A CGG AGG T AAGG ACTGGCTCGCTGTGG CACTTAGTG 
CCAATTTTGCGAGGTTCATATAGTGACAGTGCGTCCCTCTCGGGTTTCATGCGCCGCACCAGCGCGGCCAAGACTGTACC 
CCATTTATTAGATGACGCACAAATCTTGATTTTTTTATCCTGCACTTCAGTGCCTATGTGGTGAGCGTAGAGTTATTTTC 
AGCTGTCTCTGGTGCTGTTGCTAGGAGATGCAGGTAGTTCAGACAAGATGTGGAGAAGGACTCCCAGAAAAAAGGCCACC 
CGAAGGTGGCCTATGAGTTCATGCTGCGATGAGTTGTCGTGCCAACGCCATCTCCACAGAATCCCCATCCTGGTTCAACG 
AGTCGAGACCTGACTCGGGAACGTCTCCCGACGTGCTCTGAGCCACAGCGATCTTCTTGGCCATCAGCTGTAAGCAGGTG 
ATCTGCGAGCTGCCGGCGTAGCCGAAGAACACCACCCTCACCGGGTGCTTCTGCCCGATCCGCCACGACCGCCGCGCGGC 
CTGCTGCAGGGTGTACACGTTGTAGCCCGTTTGCAGGAACGCGATGGTCGGGAAGTCGAGCAAGTCCAGCCCGGTCTTCA 
CCAGCTCCGGGTTGGTGATCAGCACATCGATGCCGCGATCGACCTGGTCGAGGATCCAATCCTCGCGTCGAGCGGTATCG 
ACCGAAGCACGTAGCACTGCCACCTTCAGCCCGGATTGCTCGAGCACTTTCTTCAGCCTGGACGTGGTGTCGCGCGTCCC 
GCTGTAGACGGTGTATGCCAGAACCTTGCGGCCCTTCGCTTTCTCCTCGAAGCAGAGGTCCACCAGCACCTGCTCCTTGG 
G T AT C AA CTG CT CG TCACCG AAG AT CGCTGGCACG AAGGCCAGTGTGTCCCGGGTTCGCGG ATGCTTGACGATCTCCGGT 
CGGAAACAGCAGTCCGGCCAAGCCAGCAGCACGTTGAGGACCACGCCCAGGAGCGTGGTATCTCGTCGCGCCAGAGCCTG 
GCGGAGCTCCGCTGTCAGCGTGGCCGCCAGGCGCTGATAGGCCGAAGCCTGCTCAGGCGCCATGGGCACGTCGATGAACT 
CCTCCTGGTAGTCGGGCAGTACGTTGCCACCAATATCCTTGAGCTTCAGGAACACGGTGAACGGCAATACGAAGCGGTGG 
ATGCCCTTTGGGCCGAAGCCGGGAGCCTTCACCGTGCGTACCGAGAGCTTCTTGCCCCGCGCTGTCTTGTGCGAATCACC 
GTCGCGCTCGGTGTAGATATCCTTGAGCACACCGTGGTCGCGCATGAACGACATGGCTGCGGGAGCCATGCTGCCGCGCG 
CGTTGGGCCGATAGCCGTCCTCGATCATGCGCTGGGTGAGGATGCGGAACAGGAGATAGAACAGATCGTCGGCGTAGCCG 
CCCATGAGCGTTCCGGTCAGCAGCACGGTTTTCCGTGCCTTGGCTGCGAGAACGCCCATGGCCTGGCCCTGGGCCGAGCC 
GCTGTTCTTGTACTCATGTCCCTCGTCCAGCACCAGCAGGTCGAAGTAGCCATCAGGTAGGTAGCGCTTGATGAACTCGG 
TCGGTTGGTAGCCGCCTTCACCGAAGCCGAACTCGATGTTTGCCATCGATCGCTCCATGCGTTTGGCCTGCCGATCGCTG 
AAGACGAAGTTGCCCTTGGCGTCCATCAGGTTGATGAACTCCGAGACGTTGTCCACCAACATCGTGGCCAGGAAGTCCTC 
GCCGAAGTCGTTCAGCAGGCGCTCCGCCCTGACCGGGCCGATGGTTGGTATCCGGCACATCGACTTGAGAATCGTTGCGC 
GCCGGTTGCCGCCGTCGGGCTTGCCTGGTCGGATCAGCGTCCAGAGCGCCCCACGGCAGGAGGAACAGGTACGTCGACGG 
TCACCACGCTCGAACTCCTCCACCGTGACCAGGTTGCCTTCCAGGTCCTCGAGGACCTGTCCGCAATCCGGGCACGCAGC 
GAGCAGTTGGCCGCCGGCGGCGCGTTTCTTCCAGCAGGCGAGCCGCCAGTGGAAACCCATCCGCATCCGCACGCGGCCGA 
GGATGAAGAACTCCTGGCGCCCGTCGTAGGCGTCGCCCATCTGATCTCGCAGCTTGAGCAGCTTGAGTAGAGTATCTGGG 
CCATTGAGTACCCAGACGCGGGCGGCTGGGATGGTCTCCAGGATCTCGCGGCGCCACTTGTAGACCAGGTGCGGCGGAGA 
G ACG ACCAGGGTCCGGCG ATAGC CGGCGGCGTGCATG A CCG CTGCG AC AGCG ATGGC C ATCATGGTTTT CCCGGTGCCCA 
TCTCGGCGTTGATGATTCCGGCCTGCTCGTTACGGTCCAGCAGCAGGGCGGTGATGGCCTGGACGACCTCGGCCTGGGCC 
GCGAAGGGCTTGCGCTTGAGTCGGTCCATCACCAACTGGCGGTGAGCGTTGACGGAGCCGGTATAGACCGGGGGGTTGGA 
GCGATTGAGCGACTCCAGGAGCTCGTCGCCGAACTCGTCGATGAAGTCGGTCAGGTTGATGTTCAGGTGGGAGGCGGCGA 
GGGCGGCCGGTTGGGTCAGAGCGTTCATGTGCATCTCCTGCAGGTGAAGAACGGAGCAGGGCGTGCACGGTCCCCTTTGG 
GTATCGGCATACCCTGCAGGGTGGAGTGGAAGGAATCGCCGCGGGGGCGAGGTCAGTAATCGCTAGGGAGCAAAAGCGTG 
GTTGAGCTGCGGTCTGCCTCAGTGATGATCCAGAGCCTGGATTCGTCGCCGGCGTCGATGTCGTAGGACGACAGCAGCCG 
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ATCGCCGAACTTCAGGGCTCTCTGGTTGGTGTTCCAGTCTTCCTGGTCCAGCGTTCCCCAATCTCCCGCCAGATGGCGTT ' 

TCAGCAAAGGCGCTGGGTTGAGTTGACCGGTTTCCACCAGGTGGCTCACGGCTGCGGTCATTACGACCC^rcAGTCCTC 

ATCAGCAGCGGTGCGGGTGCC^GGGCCGGCTCA^TTGGGGCTCTTCAGCTTCTTCCGTGGTCGC^ 

ggtcagcacgccksccctgattgaccgaggagggtgtcatttcccatgccc^ 

TC S* c f c I CGG J GATGTTG ^^ 

a cgaccaggatccggcccgacttcgatcgcacgacgccagatatcgcgccggcggccag«;ccagggccag^tccca^g 
agacagctcgcggactggtgggcgcggctgcagccccgcttgcgcg^^^ 

gccgctggatttcaccggcgaactgctccggctccagggttactcggtag^ 

accacgtagggttcccacc^gccaagccgctggaatttcctcggccttttcctggcccgctccgatc^ 

cctcacctgattggcgtccgcccgggccaggtcctgccgacggaccc^atgccgaagatcaccac^ 

gatccgcggctgcgtagatgcgcaggccggtgaagtggttgctcaaccagccagtcagctcatc^ 

otaacaatcagaaccatgacgccgccgtactgcagcaacggcaggcagcgctggtagaacgctttctccag^^^ 

sccgctgccctggtactgcgacgcaccggagtggtccgccaccaggtcgcca 

accggcgcaggggtcacagatcc^catc^cccg^^^ 

tgagagggatggctcaggcttgcgcctgagcatcggtggtaagaatgctttcgcggattacctcaccgagggtgggctca 
agcacgtcgacccgcagggcgagtcgccaggcgcagacgttgccgatcgtcccg^a^cgtcaIc^^^ 
ggtgagaacctccatcaccggctcgcgccagtgctgcaggagggggagcggaca^ 

-TCTGGTGTGGATCCTCATCGCGCTGCAGGAGTGCGAAGGCGAGGTGATTGCCTCG^TCG^CGCra^ 
AACAGCCAAAGATTGAGttGGCTGCCGAACAACGTGCCGCGGAACTGACGGGTGGTGCG^TCTC^GGA^TCCTCGCT 

tgggaagacaggtaagcggcggccgtccacgatgatgtggaaatcktcc^tgccattttcttcccggS 
tggccaggaactcttgtgtcacggtgtcgcggccccaggccgaaagaaagaccagg^gcIct^^ 
gcgtcgacgtacaggtctggacactcttcgatctggtac^gcggggtgggactc^gcataggaac^ 
gccacgccgccctcaaggggcggtgaagcccctcggggtgtagtccaac^^^^ 

ctcttggtgtgaggctcgaagctgagtccatcggcttggtgcagcggcccttggccgwcaggaagacS 

ctgatcgcgtagcgcc^cagccgcttccgcctgctccaaggtcagcgacagctcctcgatga^ 

cgtcgccggcgacgtcgttgttggacagttggtcctcgagstac^ 

GGTGTTCGTGCGTCAGGCGTTTACGCCATTCCTGGTTGATCTCATCGTGAGCCAGGTCCAGGAAC^CAGCGGACAGCrc 

tagtagaacgggccggcgctttccggaatggtcttgtagccccattgcccggccgagcagtc^gcaIgS 

CATGGTGATCGAGTGACCGGCCGGCTTGCCGAAATAGAATCCGGCAAGGTGAAAAGTTCGTT^A 
CGAGGCGCAAACCAGCAACAACTCCTCGCTGAAGCCTGGGGTGTTGTAGATCGATTTCATGCOTCCTCCTTO 

cacctgttgtttgaagtcgagctcatagcccagcgcgccgagtcgcacgcactgttggcgtagtcgcttS 

TGGTGTCCAGCTTGACGATTTCGCCTAGCGGCCAGACCGTCCCGAACAGTTCTGCATCCGCGTCTG^ 

gcctgctctgcaggagccggagtgtccataccgaaaggcgcgtccctggttgcagcgaggcacS 

GGGTGACGGCGACTTTGGTGGTATCGCCGGTACTACTGTGGGGAGCTGGATCTCGGGCACTTCGT^ 

cattgccggaaagacgctctacctcctcgtcgctgaggctgtcc^tat^ 

atctcgatgactagccgaccgccggcggagtagtggtgggggcggatattggcgatgacga^tSccgcSt^ttgcc 
ttcggtgtgttgatccaggaaggcatccttgatgatgaactccc^^ 

catagcgaccgttgatgsttcgaatggccaactgtccgggaactgt^^^ 
ccgaagcgggatcgtggtgaggggatgaactactcggtaaccgggtgttcggcga^cac^ttc^ 
caggtaggaaggctcatcggac<^ctcgactatcgagagaaccagcagcgcgagcgcg^ 
tgtggtacaggtggaattggatgtgcttcgcggtcggctggagaga^^^ 

AGCCGGGTGCCCATCTCGTCGACCCTGGCAGGGTTGGCGAGATAGACCAGGCGCAGCCAGGCCGCG^S^ 

cgggagtttgaggtacgcagcagtgtgacctgcgctggagtc^ttgttgagagaggtcatgagaS" 

GGCCTCTCCCGCGGGGATGAAGTCCCCGCGTGGGTGGGTAGGA^^ 

agtgctgtggcgctcttgggtgggttagcgcttgcgcgccggagc^^^ 

acagcgagccaattcgaattcgggggctggcgggatttcccgccggccggattgctcaatcgtcatS^ 

ttgagcgtgggcgcgaatggatcaagtaccaggagccgaatgtctgcttgtcccgcgaggtgamcagatc^caa^tc 
ttcgtcgaggccgaaggcaagctgttcttgtcgaagagcggctgcggtcatctccaggatctccaga^ 

======== 

======= 

ggamctctcatagccgcgaataaacgggttgagttgagtcatgggatatctcc^ttgccggga^ 

A^CTATGGACCTCCAGGTGAGTAAGGGGCGCCGAAGCGCCCAGGTGATGTCGAGTTGTTGctG^S 
TCTTCTTTACGCCGAKCTTCTGGTATCGCTCTACCAGGC^^ 

-^ G " Cacagtcggcctgcgcc ^ gcttcgattgcat cttgaattacatagagctcgaS 
attcatcaacccoscctttgcactggatgtactcgaacctgctttgtgcagcatgcactccaatgttgaS 

AGCCTTTCTGGATATGTGTTCATGGTGAGTTCCAATTCGTAATCAAATCGGAGAGGTCACC^CC^tt 
CCCGTCTC^TTGAATTAGAAGATGCACATGGGC^ 

TAATGCCTGTTGGGCAAGTTCGCGTGAGGGGTAATATTCAACCGACTCGCGCGAGGCTGGTCCTT^ 
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CGATGTAGTAAGCACGGTTCTTATTCATAAGTACCTGCAACGGCAGTTCCAGGCTGAGATGGGCGAGCGCCAGCTTTCCG 
TATTGCATGGTGATTCTCCTTTCGGTAGGAAACACCAGTCCCATGGGGACTGAGGTCCCCTAAATGGGAAATGTTTGGCG 
ATTATGCCAGGGCAGATACCTGGGCATTGTTGTTGGCCAGGAACAGCCAGGCGAAGTTGCAGAACTGACTGCTGAGCAGC 
AGCTCGTCCTCGACGTCGCTGAGGCTAACCTCCTCAACTTCCGGCAGGGCGGGCAGTTCTACGATTTGGATGTTGCTCAT 
GATGATCTCCAGATAATGGAGAACACCACGCCCTGGGGGGAGATGGTGCCCCCCGGATGGGTATGCGATGAAGAGTGGCG 
CATAGCGTAGGTAGTGAAAGGCCCTCGGGATGAGGGCCGGTATTCAATCAAGCCGTGTTCAAGCAGCGGCTTGTTCAGCG 
TGCTCAGCGGGGGCTGCGGGCTCACCTTGAGGTTCGCCTTTGCCTTGCTGGGCTTTTTCAGCTTCTTCCTTCGCATCGTA 
GACGGTGGTGCCATCCACTTTGATCCAGGAGATGAAGAGCAGCCGGCCTTTGAGGCTTGCGTCGGGCTTGCCTTGTTTCT 
CGCCTTTCTGGTGGATGAAGGGATCCGCCCAGATATCGCCGATACGGAAGGAAATCAGAACCTTCTTCTTGGCCTCGACT 
GCTTCCTTGCAACGGCGGACAAGCTTTTCAGCCTGGGCGCCGACCACTTTGCAGTCGATGTAGGAGTATTCCACGCTGTC 
TGCCGCGC C ATGG AGGGCTG CG A CGGTT ACGGCG AGG AATGGTT C A CCTCGGCGG ATCGGT ACCTCGCGG ATGCGATTGA 
GGTAGCCGATACCGGTGGTGTGCAGGTCGAAATACTTGGCTTCTTGGGCTTGGGTGTTGTTGCTCATGATGATTCTCCAG 
TTTCGTAAGCGAAAGCGGAGAAACACCTTGCCCGTCAGGGAAAAGTATTCCCCGCATGGGTCGGTATTGGGAGGTAAATC 
AGAGGGAGAGGGCTTCCATCAACCTGTGGGTTGATTGATCGCGCTACCGGAAGCTGGTGCGATCTTGGGTTCGATCTACA 
Cr,GA_AGTCCGTGTCGTAGTCTTTCAGATCATGACTACTGGGGGAGGGTTGCCGACAACGTCAGCAGCTCACGTGGTAAAG 
GATGGGCAGGGGAAAAATGAATGTCAATACCCATGAAATGAAAAAAGCGCCCAGGGTTGGGCGCTTTCATGGTCAATGAT 
GACGGGGAAGAATCCGATGTTTTTGGCACCAGGTCTTGACCTGAGGCCAACTGAACCAGAGGGCTATGCCGGAGCAGATA 
ACCACCAGGCTGGTCATGATGATCATGCCGAACTCGGTCTGGGACATTTCACTCCTCCTCGGTGAGAGCGATACCTATGA 
GCCAGATTACCATACCCATGACGAAAGCATATGCCGTCAGCAAGCGGTGCTCGCTGAGGAAGATGCCCACCAGGCTCGCC 
..GCAACGGCAGTGGTACCGATTTTTCGGAGATCGTCTGCGAATGCAGAACGTTTCGATT.TCATGGTTTTCCTCGAAAAATG 
CCGGAAAACCACCCTGCGGGATGGTTCCCGGCAGGGTTGGGGGTGGTAGCAGTGGCTATTAGCGGCCGAGAGCAGCTACT 
TGTTCATCCGTCATGGTGGACAAGTAGTCCTGCCAGCGTTGTTGGTATTGCTCCTGAGTACGCCGATGGAGTTCCCAAGA 
GACTGCGCTACCAGGAGGTGCTGGCAAAAGAGCAATCTCATGATCGTTCAGGGACTCTAACTGGAGAGGGGGCCTCCCCA 
TTTCGCTTTCGGAGTTGGGTGCAATCAAATGGTCGAGGGAACCCATCTTGAGCTCCTTCCTGACTGGCGTTTCAGCTTTG 
GCGGTCGCCTTGGCCGGCTTGGGTTTGCGTGCGCGCTTGGGCTTTTGCACCTCGCGCGTATTGGCTAGATACCAGTCATT 
CAGGTTGACCTTCTTCAGCCGGACGATGGCTGGGATGTCGCTGATGTGTTCTAGCTCGCCAGCGGTCGGGCCGGCCAGGA 
AGGCAGGATCGATGCGCTGGACTCTCAACGCAACCGGCGCTTGTGCTTGCACTACGACCGGAGGGCGACTGAGCGCTCTG 
CGGAACTCACGCACAACACTGCCGAGGGCAAAGCCGATTTCGAAGCTGGTGGATTGCTTGGACATGGTGTTTCTCCTGCA 
GG TTG AGGG TGCGGAG AAAACACTGGCCCTGGCGGGG AAG TGGATT C CCCGCTGGGTTGCGGAAGGG AGCTTCT ATCG AC 
CGAGGCCGACCGACCGCGCTACCAGAAGCTATGCGGTCTTGGGTTGGATCAACACGGTGGGCTGTGCTGCAGCATTTCGG 
ATCCGAGCTGCATGGGGGCATGTCGCTGACAACGTCAGCAGATCACGCGTTCATCATCGAAGCGGGGGGCGAAAGTGTCA 
AGGTCAAGCAGGAGGGGGAGCGGATGCAGCGGGCGGAGAGCGAGGGCGGAAGAGCGCGGCTGGTAGAAACCCTGTCAAAG 
GTTTCCCCAGCCGTGTCTGGAAGGGGAGTCAAGTGAGCCACCTGTGGTTGAATCGTCGATCCCTGGGCATTGATCGTCTC 
GATCCCATCACCCGGCCATTATCGTGGCTTGGCCAGCAAACAGTAGGCACGCATCCGCGTACAAAGGGAGCCCTGCGTAT 
CACCGGCGGGCCACCGGCAGGGAGAAGGATCCCGATGGGTAGCCTGATAGTCCTGGAGCAGGAGCATCAGGCTACCCATG 
GAGAGGGG AAAAGG AGGGGCCGT AAC ACC AGT AC G AC C CTT AAAT CG AGG AAACACCGAACCTCTTG AGAACCCTATGTG 
CCTTTAAACTCGCTCGACACGAGTTGACGGCTGCATGACAAGCAAAAGAACAGGGAGACACAGGGAGAATTCTCAACCAG 
ATCCGCACGCCCCACCCGGTGTTGTAGCGTGCTTACGGTTTGGCGATGAGGCGAAC.CTGCATCACCAGGGCGCTACTGAT 
GACCGTCAGTAGCCAGCGGGGTGAAATCAAAGTTGATGGCATTATGTGTTCAGAAGGCCGGTTTATCAAGCTGCGCCGGG 
GTTTACTGGCCTGTGGAGGTTATTTGACCGCGACCACCGCGCCCTCGGGGATGGGCGTAGGGAGCGCAATGACAGTGGCT 
CCGAGGCGGCGCGTGACTAACTGGCCGTTGCGCCTGACGCCAACCAGGTCGGATTCCGTACCAGCATCCACGGCCGCCGC 
CAGCCCCGCCGGCGGCGAGGCGATGAAGGCGTAGCTGGTGCCGGCGGCGATGTAGCCCTG71AGCCGCGCTGGCTTGCGGA 
ACCAGGCCGGTAAACCAAGAGCGGAGTCCGCCGGCGAACCGGTGAAACCGGGGTTGGCGTGTGCGTACTCCGCCAGACTG 
GACCGGAAGAGCAGCAAGCTCCTGGCCAGGGTGTCGACCTCGGCCAGTTCGGCGCTCGAGGTCGCGTGGTCGCTTTGTAC 
ACTCAGCCAGGTCCCGGTGATGAGCGCGAGCACCAGGACGATCCACATCAGCGGCATACGGTAGTCCTTGAGTCGTGGCG 
ACCTGCTAGTTCACGCAGGTAACGGTCACGTTGGTCGAGTCGGGGGTCTTGTAGGGTTCCGGGTAGTTGAAGCCTGGGCC 
GATCTCAGTGATTAGCCCGAACGAGTAGTTGGTGCCAAATGTCTGGTTGAAGCCCACTGCGCAGAATCGGCCGCCGCAGG 
TGATCGCGTAGCGGACGTAGTCGTAGTTCACCATGGCGCTATCCTGACCATGCAACGTACACCAGTCCTTGATCACGCCG 
GCCGTGGTGTTCAGGCGCATGGAGGCGGCGGTTCGTCCCCATTTCCCGTTTTGGCAGGACAGCCAGAGTCCGGTGCTGGT 
GATGCCTGCCAGCCCGTTCGGCGAGCAGTTGGCTCCTTCGGTGGCCACGCCTTTGAGCTGTAGGTACTCGCCGACTTCCG 
TCCGGCCCTCGGCGGTGAGTTTGCCCGCTTTCATCTCGCCGCCGGTGTAGACGTTCTTGTTCATCCAGGAGCGCACCCAG 
GTGCTGTCGCTCATGTACCAGCCGCCGCCCCATTTCTCGTTGTACCAGCCCGTGTCACCACGGGTCCTGAACCAGCCTCC 
GGTGTACGTCTCGCCGGCGACGTCAGCAGTGCCGGCTTTCACCGTACCAGTGGCTGTCACGTTGCGCGCGCTGATGTCCG 
CGCTGGTGGTGATGTTGCCGCTGGCCGTGATCGCCCCGGCTGCGGCGATATTGTTGCCTCCCATGTCCAGCGTGGTATTC 
ATCCGGTTGAGTTCAGGATGACCCGGGACGGCATTGCGGTAGAGGTACTCGTTGGCGATGGCGCCGTCCTGGAAGAACAG 
CGCCGTCGCCAGATGTCCAGCGCCGGGAGCGCTACCGAAATTGCTTAAGGCCACCTGCCAGGCGGCGCCCTGGGCGATAC 
TGGTGTTGGTTTTCGAGATGTAGCCCCCGGTGGCTCCCATGAGCTGCGCGATCCGGCGGATCGAGAGTTCGGAAGCTACC 
TGTCCACCCGTGGTCACGATCAGCGTTTCGAGCTGGTTGGCCGCCGGCTTGCGGGCCAGGACCTGGTATTGCTGGCCGTA 
GATGTTGGTGTCGCGGAAGCCTGCGGGCAGGTAACGGGTGTTGCGCAGCATCGGGACGGTGATCACCGCCGGGGCCGTGG 
CGCCGGCGCTGGCCAGAACCGTGCTGAAGTTGTCCTTCAGGTACTTCTCCGCTGCCTTGGCCACCTGCTGTTGCTGCTGG 
GCCGCGTGCTGCTCGTCCAAGCCGTCCAGGTAGCTCATCAGGACCGATATGCCACCGGCGGTCGCGATGGCGATCACGAC 
GAGGGCGATCATCAGTTCGATCGAGATGAATCCACTGCTGCGCGTACTCCTCATGGTGTTATCTCCATAGGCTTTGCCGG 
GACTGGCGAGCTTGGTCGTCCTGCAGCAGATGAGCCTCCTTTCGGCGGATCTTGTCGCGCATGGACGGGCCGTCGTCGCC 
AGTGAACAGCAGTGGCTCGGCGGTCAGGCGGCGCGAGGAACCATCGCTCTCCAGCGCTTGGCAGATCACAGCTTGGATGC 
CTTCGGCAACCACGTCGTAGGCGTTTCTCGCCAGTGGCTGGGCGAGCGTCACCAAGCGCTCGATCGCCTGATGACAGCTT 
TTCGCGTGAATGGTGGCGATCACCAGGCTGCCGGTCAGAGAGGCCTTGCAGGCCTGGTAGGCGGTGTCCTCGTCGCGGAT 
CTCGCCAATCAGCACCAGGTCGGCGCCGGCCCGCAGCGTGCGCAGCAGGGCCTCGCTGTATCCGCCTGAGCGCCGTGAGG 
TTCTGACCTGGATGCAGCGGCCGAGCCCATGCTGACCGCTGAGGTTGGTTTCCTGCGGGTCCTCGACGGCGCAGCCCACC 
CCGCCCAACTCCTGCAGGCGGGCCAGGAGCAGAGAGGCGGCGGAGCTGGTCTTGCCTGTCGCCATCTCGCCGCAGAACAG 
GACCAGGCCCTGCAACTGCGGATCCATCAGTGCGGAAACCACTTCGCTCGGATAGCCGATCTCTTGGAACTCCCGCAATT 
GGGCGCTCGACCGGCGCAGCACGAAGACGCTGCCACTGAAAGCGTCTTCGAGGAGGGTGACGCGAAGCAGAACGCCGTCG 
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CCACTCGCGAGGCACTTCGACTCGGCGCGCGCCGGCGCCGGCCAGGCCTTTCACGTCGGCGAAGCCTTCACCGAGGTAGA 

GGTCGACGAACTGAGCCTGGATGATCGC^TTCACACTCACGAACTATAOSTCCAGGTAATGCTGTTGCTCT 

AGGCCTGGGTCGCGGCTGCGGTGGTCACTTCTCCGGTGATCGAGGATCCGCTGTTGATTTTGGTCTGCTCGAAGGTGTTC 

TTCGCGATCTTGGTGGCCAGCGTGATACAGGCGTCCTGGGGCAACTTGCTGGTGGTGATCGAGMGCC^TGCCGGTGGA 

^^" GTGACCGATCCGCCGTAGACGTTGTAGA CGACGCCGGAGGAGACACTCATGTTCTTCGGCACGCCGTTGATTG 

CGATCAGGCTGGGGATCAGGTTGGTACCGCTGGAGCCGTAGCCCGAAGAGGTCTTCAGGGCGCGTGCGTT^CCGCAATG 
ACGCTGATGTTGCGTTGTTCCTCGTTGGCGTTGGACGAACT^^ 

tgctcatggagcgctccttggtgac^tttcagtggttcatggattggatgagctgctgtgcctggtaggcgccgagS^ 
accaggatcatcagcgcgccgatgaagatcagggcgaagttcttcaccagcccggcggccagc^Stctgctt^ggct 
ggtctcctgccagcggcggctgaacttgaccagcgcc T cggagaagcctccccggttg<k:c^Scaca^^^ 
tggcctgccggtcgggga^tcgtgaccggcgctgcgaagggc^ 

gcagcttccaagcgctgcttcagccagggcggggagatcttgatcatgctgg^^^ 
gttgagcatgaccgccatgttcagcaggaaggtggtgccctggagcatgcggtagatggaccagggc^c^ 

CCAG f CAGACCCGGCCT ™ CAGCGGTA ^ 

AGG ™ GA ^ CCAGG I C " GTGACGAAGCTGGG ^T^ 

AGAGCAGCGCCTGCCAGATGGTGGCGCGGATCCTGGCCTGGGCCTCGACCAGGGAGACGGCATCGCCCATGGCCTGAACC 
AGGTTGCCGCTCATCTCGCCGGCCTCGATCAACGCTCGCTCCTGGGCGGGGAGATAGAGCGCCATTOCTOTCGCCAGACG 
CTTGCCGTTGGACAGCCCCATCAGCGCTTCGCGACTGGCGATGGCCACCGGATGAAACGGATCCTGCCrcT 
AGATCTTGTGCACCTCTGCCACAGCATCCTTCAACGGGACCCCGTTTTCGAGCAGGGTGGACATGCTTTTCTAGAACTGC 

aggcgttccttgcggccgaactgcttgctgtagaaggcgaactgcaactgctcccagaagcccccStca^cS 

^ GGGA ^ AGAAG " G I GGTCGAGCAGATGGTC 

catgccctcgttgatgcggcggatggcgtgggcgtgcttggtgatgccctgcatggtcttgacccagtagt^tgcct 
cggctgggccgcctttggcgaacacacgcatgaaggcgagggtgggcag^^ 

GACCAAGTCGGGCGCGAGTTGGTCTTGGTGGTCTTGGAAGCGCACTTTGCAGTGGGGGCAGAGCTTGGGCAGCAGGCTCT 

ggttgatcaggccggtcagcagggccggatcgaacagcaagccgc^tcgacgcccaggtccttca^gc?Sa^ 

CCGA ^ GCGCTGTTGGTGTGCAGGGT ^ 

gaggtcgcgtacttcgccgatcatcatgtagtccggatccaggcgcatgccgttggcgatgcccgcggccca^cct^c 

GTTCTGCGTCTGGGTCGGTGGCGTCGTAGACCAGTGGGGTCTGGTTGATGCCTTCGCCGCGAATGC^TATTrcTCCG^ 

tcctcgatggtcaggatgtgcttggatccgccatggagcttgtcgaggcot^ 
tcctgacccggtggggccggacaggatgttgatgccgtagggcatacgcatcatgcgatcgaIcI^ 

gcaggtagcc^gctgctccaggctgtcgaggccggtgtcgtcgtagagcagtcgcaggatcatcaggaacc^ 
cgcgggcgggtggcgatccgggcactgaacakttgagcttctcgaw^^ ' 
ttgcggcttgmcagtosctcggccacgtcgcacatggattc^ 
ggaactgctcgacggt^cagcaggccgtcgacgcggaaacggatcttgctgccgctccc^ 

tgcacgtcactggcgcgcagctcagtggccttacgaatgatcttgaccacgcgctcctgcatctgggtScgcgct 
atcgctatcgaccaggccgtccatggcc^cwcgcggtaaagctggt^^^ 

atcggaagccac^cgatcgagacggtcgatgaacgacagaacgtgtatgtcgttctggtggc^tccS 
gtgccgtcggcagccagagccatgatttcctg^^ 

ggtcaccatggagggctgcgcaagcgcggcaatctgaaggttcgtcatg^ggtocctactgaatgaac^ctgc^^ 
agccccgggcagcgccggcggaaccgac^cgccttgggccg™^^ 

cgggcacgcggttgccgtccttgtcggtgagcacgacctggtc^^ 

tccgcgccgctggcggcgtcaacctcgtacccgcccgggaacaagaa^^ 

GGTCCGCACCACTGGCOSAACCGGCTGCGTGATCGTC^^^ 

cgctgacgacctggccggtttcgcctttgccttccaattgc^^ 

atggcctgggcctggatctccgcaagttcgccaacggtgccagcgcaggcc^ 

cacggccatgccgatcg^tcagttcgcatagatttctcctgtgacagtccagtccaactggctgtcct^ 

(^ctgatcgacagcttgggctcgaccccttgacggtacaggtggctcgtcagcgcctgcagaacgtcgtccgccggcagt 

agcggctcatcactgccgatggcaaccttcagatcgaccttcackksccgcggtgttgccgttgtcgatgacgaa^c^ 
gcggtcggcgaacaggtgctggctggccgctgtcaggtc^^ 

ggaccctgccgtcgcacttgctggattcgaagagccagccct^^ 

gcgcgtagcatgtcctcgaggtcaggtatgagcgtccaaggcttctgcaatgacgccaggtccagc<^StScctg 

ggcgttcttctcggccagctcggcgaggcgcctctgctcctcgaggagcgcggcttgcctggcgagctctttctggt 

cattccataggtagtaggcggttagcgacccacctaccacgcaaccc^gcagotccactgccgtccactccct^ggac 

aagccgaaggtgagttgccggaggcggtagtcgcgccgcaggcgcttcggcgcgagcagttcctcgatgtcgaag^ 

gacggggaaatcgaacccttcgggaacgaagacctgtgcgtttcgcag^^^ 

cccgggcctcgtcgagggtggtgaccagatcctgcccggggacaatcgcgcc^tcgagcgtggcgaccagc^gtag^ 
tcctcgtcc^ctttccagcac^ccaggaagtcgccgtcgaactggcct^ 

gactgcgccttgcgacttcgaaacgaagccggcctggatcaccgtcg^tgaatgxk:ggatg<;caacgatgtccagatgct 
cttccttgcccagcttgcgcgcttccttcatgtactgccgc^^ 

ttgttgccgt^tagctcaggatcgagacgtctggtccacggctgccgaostcaggcttctccatggttcagcc^ 
gacgggggtgatcagcaccacgatgacctcgcgcttggtattgc^ 

cgccggtgccgaccttgttggtgtcctcggtc^tctggtcgaagccggagagtaccaaggtctcgccgct^ 

CGGGGAG fI GA J GTTGATCTTGATCTTCAGC ^ 

CGCTGGTCACGGCGCCCGGGATCAGCGAGGTGGTACTGCCGACCTGGGCGACGTTGGAGATCTGGCTGGAGGCCAGGTAG 
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CTGT CG T AGCGG CCGATCTGGAT CGGCGCCG ACTGG AGGTTG AG CGTGGTCACGG ACGGGGATCGGACGGTCG AG ACGCG 
GCCCTGCTGGGCCAGCGCCTGGACCATGGCCTTGGATCCTGCCCAGGCGCTGTTGGCGGTATCCAGGATGCTCACGGAGC 
CGGAGATCGCGCTTTGATCGATGCCCGGCATGGTGTTCTTCAGGCCGATGCCCCACTTGTTGTTGAGCGACTTGTAGACC 
AGGTTCCAGTCGATCCCCAGTTGATCCTTGTCGGTCAGGGCGACCGAGAGCACGTTGACGTTCAGCAGCACCTGCTTGGT 
GATGCTCTCGTTCTCTCGGTTGACCAACTGCTGGACACGGTTGAGGACTTCTGGACGGTCGGTGACGGTCAGGGTGCCCG 
TGGCACGCGACAGCGACATGCGTCCCATGCTCGGCGTCAGCATCGAGTTGATGCTGTTCTCGATGTCGCTGAGGATCGAT 
GTCTTCAGCTCCGAGCTGGTGGTCTGCTTGCTGCCGGAGTCGCCGCTGATGCCGGAGCTGCCATTCTGTCCGGTGGATCC 
GGAGCCGTCCCCGCTGATGCCGGCGGCCGTCGTCATACCGGAACGCACGGTGGAGTCCACCGTGTTGACGTCGTCGAAGG 
CGTACATGCGGAAGGTCCGAGTGTCCAGGTAGTAGAACTCGACCCTTTTCTCGGTTGGGTTGTAGCGCCAGGACACGCCG 
GCTCGGGCGGCGATGAGATCGAGGAACCCGCTGACTTTGCCGTTCCATTTGATGTTGTACAGAGACCGCGGACCGTAGGA 
CCCGAAGCTCGACCCCATGCTGCCGCCGGCACCGAGCGACATGCCGTTGGCGACGGAGGCAGGAAACAGCATGGTGGCCA 
TGTCCTGGCCGCCTTGGATGGGCGGCGGGGCGTTGCTCGCGCGCTGCTGAGGTTGCACGGCGAAGGCGGCCGGGTTCAGC 
GCGTCGGGCGTGATACTGACCGCCATGTGGCATTGGTTGATGACTTCCTGGGCGGCCTCCTGCAGCGACGCTGCGCCTGC 
AGGGCGCCACGTCACGATGCAGTCACTGGACAAGGTGTGCGAAACGCTTAGGGGTTTCGTGCTGACCCAGGGTTTGTCGG 
AGAACACCACCGTATCGCGCCGATCCGGTCGGGTGTTGCGCACCTGCGCTGCGATCGTAGACGCGGAATCTGCCGTAGCT 
TCGCACACGATCCGCCGACTCGTTCACCCGCGTGACCGAGCAACTGGCCAGCGCGGGGATCAGGAGGCAAGCCAAGGAGG 
TTCTTCAGGGGGGCACGCATCAGTTCTTGCGCTCCTTGATGATGATCAGGGACTGCGGGCGGCTGGCGTTCACCAGGAAG 
GGCCGTTCGGCAGCGTCATACAGGGGGAACAGCTCGGATACCGCGTCCTCGAAGGAGCCGTGGAAGGTCAGTGGAGCCTC 
GATCGGATAGTTGAGATCCTGCGGCTCCCAGCGGACGGTCCAGCGTGCGCGCTTTGCCCAAGCTTCCAAGGTGTCGCGCA 
GGGTTGATCCCGTCTCGGCTGACCAGGCCTGTGCCAGCGGCCGGCTCGGTTCCGGCGGCGTGAGCACCTTGACCGGTGGG 
-GCGGAAGCCACGGTGGGTGGAGTGGGCGGCGTGGACTTCACGGCGGAGGCCGCAGGCTTGGTCGGTGCCGAAGCCGCGGG 
-AGGAGAAGAGGGGTGGCCATCCTTGGCCGGTGCGGCTGGGGGGAGCACAGTGGTGGATTCAGGCTTCTTGGCCGGAGCGT 
GGCTGGTTGTGATCGGCGCCGGTGTCGGCACCGAGGCGACCACGATGGGCGACTCCAGCGTGCTGACTTTCTCCGTGGAG 
GAGGGTTGCGCTACCGCCGGCGGAGTTGGGGCAGCGGGCTTCGCATACAGTTGCTGGACCGGTTTCGGCCTCGGCGCCGG 
GGGAAGTTGATAGCCCGGGCGCAGCACGAAGCAGACCTGCCGCGCGACCTCGTCGACCTTAACCTGCCAGGCTGGGCCGG 
AGAGGACCTGGAGGGTGTTGCGCAGGGTCATCGGGCCGAGCTTGTACTGAGCTGCCGGCAGCGGCCGGGTGTAGAGGATG 
TTCACATGACCGGCGTCTGCCGGGCACAGCGAGTAACCCGAGCGGCTCATCACGTACTGCATGGCGTCCTTGACGCTCGG 
GTTCATGCTCGACGGGATGGTTACGTCGATGATCTGGGCCATCAGGTCGCGTTGACCGGCATCAGGCTGGGTGCTGACCA 
GGGTGTAGCGCCCATAGCGCACTACGGGCTCTTTCTCCGGCTGCACACCGTTTGGATACAGGTCCGGAGAAAGCAGGTCT 
GGATTGCGGCTGCTGTCGAAGGCTGGCCTGGCCGGCTCAGCCTTGTGGGTCGTGCAGCTGGCGAGCAGCAGGCACAGCGT 
GAGAGTGGTCAACTGCCTGGTCATTGAGGCCCCCTTTGATGATGGGGCACACAGTGGCAGGCCATCCGCCACGCATCAGC 
AGGAAATACTCGGCAACAGGGGAGGGGATTTCTCGATGACTAGGCAGAGGGTGAGCGAGGCAGGCGTAAGGGAAACGGAT 
TAGGAGCGATGAAGGTTTTTCATGTTACGCCTAAAGCGAGCCAGGTGGGAATCGTTGTTAGAAATGGATAAAATAGAAGT 
TCTAATCGTTTTAGAATGCTTTGTTGAACTCAGGTCAGACAGTACGGGGTGCTAATTGAGCTTTAAATACTATTGGGCTA 
AATTTTTCTGGGGAGCTTTCTTCTTTGTTTTAGTCGCTTGGAAAGGCTCCGTATTTCCAAGCCTGGCATCAGTTAACCCC 
TTGGTAGTGGCTGGATTCAGTACTATCCTGTTTCCTTTCTCGGTAAGGCTTGTTGAAGACTTCGCTTTAAAATATACGGA 
AAAAGAGTTCTGGGTCACAGGTTTTTTCTCCGAAACCCCTGCAAAAACAGGATTGTATGCAGTCTTTTATTTGGCTTGTT 
ATTTGTTTTCAATTCCCTTGGGGATGATTTTTTTATTCTATAAATACGGAAAGGCCTCGTAGGATTTAACGAGGCCTTGC 
TTGGTTTTAAATGGACATTACAAGGTTGTTTGCTTTTTCTAGAAGGCCTTCGTCAATCATCGCCCCGGTAACTGCCATTA 
CCAGTGCGAACCCCAGGATGCCTATAGGAGTGGCCGTTGCCGTGGCAAATGCAATACCCACAAGCCAACTGGCGGCCGCA 
CCAGCAGCTAGTGTTTCAACTTTTACAAAGAATGGTTTCCAGTCCCCGGTTTCCGTAGATATCTTGAACTCTTGGTACAG 
GCTGGCGGCGTCAATAGCTTTGCCTACAACTCCAAAGCCTTTGCTAAATTTCTCAAGGCTCTTCGCCATCATCTGCTTGT 
CTAGAGAATCAAAGGCTTTGGCAATGGCTTGCCTATCTTTAAGGCTAAGTTTTTTATTTAACGCATCCTTGTGTTTTTCA 
AACGAATTGATTGCTTCCGCCGAACTCCTGATATTTTTCCCCCTGGCGCCTTCGGCCAGTTGGTGCGCCATCTCTGATGT 
TCGTGCGCCAAATTTCTCAGTTACTTCCTTATAAAAGTCGGCAGTAAATTTAACAGCATCTTGCAACGCTTGTTCCTCGG 
CCGCCTTGCGTTCTGCTTCTGTCCTGCGTCGTGTCGTTTCGGCGTTCAGCTTATCTAAGCGGGTCTGTAGACTGGCTATC 
TCATCCACAAGTAGCTCACCATTGTAGATGGCTTTCTGCTTTTCTAGAGGGGTTGGGGCATTCTTGATAAGAGTATTGTT 
TGTGGTCTGCAGCTGCTTTTTATCAGCAATTGCTTGCGTAACCAGTCTTGTGGCACGTTCAAGCCATTGCTCTGGCGTTG 
GACTGACCAATGGAGGCAAAGCACTTCCCGCTTGATCAAGCTCGGATTGTACTTTCATCTCGAGATTCGCCTTTCTGGTT 
TTGTAGTCCCGCTCCAGTTGAGAGAGGTCCTCTTTTTCCTTTAGCGCTTTTTGCTTGTTGAGCTCCTGTCTTAGCACTTT 
TACCGCTTCAGAGAGTGAATCTTGTTTGTAGTAGGCAACCGCCCGCTCCAGTTCATCTCTAACAAGCTGCTGAAAAGGGT 
ACTTCTTTCTTTCTTCCCAAGAGTTCTTCTCCCAGATCTTGACCCAGGCCTCATAGATGGAGAGGCTTGACTGCTTTTCG 
TAGTCAATGAATGGGCCTTTGGTAAAGGTCTTTTTGTATCCTTCAAGATCAGATTCCATCTTGTGCAGGACTGCCGACAT 
GACATCGACTCGCTCTTGCCTTGTGGCCTCCCTTTTTCCTTGAGTGATCTGCTTTAGCAAGCCATCGACTTGATCAGATA 
CTTTCTTGTTGATCTCCTTCTTTTCTTTTGTTCCTTGTTCAACAATGGATCGGACTGGATAATAAATGTTTTTCTGTAGA 
TATATGTCTCTTTGAATCAAGTCGCCAAACATATTCCCGGCAACGGCCGCGCTATCGATAGGGGGAACCTCGTATGCGTA 
TTGATCGTCTGGACCTTGTGCGACAATAACCATGGAACCAGGTACTTCGTTGTCATTGGACATTTAGACTTCTCCATTGG 
TGAGTGTGGTACAGAAACAGTGCCGATTCAGCTCTAGAAATTGAACCCCATGGCTTCAATGTGTGGGCAATGCACTAAGA 
AGGCACGTCAATGCGCTGTGATATAGTGGTAATAAGCACGGTTGCTTAAACTCGATATTCAGGAATGGGATTGGCTGAGC 
CCACGGGATATTCCCTAAGAGTCATGTATCTCTAATAACGAGTTCATGCGCTCAAGATTCACGCTGGGTATGTGATGCTC 
TCTGCAAAATTCAGCTACTTGATCCGCAACACCATTCAAGGGTGAAGATAGGGATAGCAAGCGAAGAGACAGTTTTGTGC 
GTTCCATGTTAAACGGGAGATACACGTCGAAGACCTAAGCCATTAAAACGTGCTGACCATCCAGTTGGGGCCCATTAGGA 
TCTAGCACAGCACCCATGGTGCACTGGGGCTATTTCTCGGATGTACACCGTTTTGGTACAGGTCCGGAGAGAGCAGGTCT 
GGATTGCGGCTGCTGTCGAAGGCTGGCCTGGCCGGCTCGGCCTTGTGGGTCGTGCAGGTGGCGAGCAGCAGGCACAGCGT 
GAGAGTGGTCAACTGCCTGGTCATTGAGGCCCCCTTTGATGATAGGCACACAGTGGCAGGCCATTCGCCATGCTTCAGCA 
GGAAATACTCGGCGGAGGATCAGGGCTTTTTTATCCAGTTAGGCTTGCGGCTAAAAGCGCGCGGTTTCCGTCGACCTGCT 
CGGGCTCGATGAGGTCCTGGGCAACCTGGCGTTTCGCGTTGAGCAGTTGCCACAGTTGCTCGTCGATCGTGGCCTCGACC 
AGTGGGATTTTCACCACGACCATGCGGAGCTGGCCGTTTCGGTACGCGCGGTCTTCGGCTTGTTCCTGCTGACCGGGAGT 
CCAGGGCAGGCCGAGGAAAAACACGTAGTTCGCCGCAGTGAGGTTGTTGCCCGTCCCTGCGGCCGCCGTAGTGCAGATGA 
ACACTCGGCAGTCGGGATCCTGCTGGAAGCGATCTATCGCCTTCTGCCGCTTGGTGAGCGAGTCATTGCCCACCAGCGTG 
ACGCAGCCGTGTCCGGCCTGCTCGCAGAGTTCCTTCAGCGCAGCCACGGTCGGCTTGAACTCGCAGAACAGGATCACCTT 
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GTCCTCTGCGTCGAGCTCGCTCAACAGGTCCATCGCCACGCGAACTTTCACCGTTTCCAGGTAACGCCGCAGCGCGCCGA 
GTCGCGCGAAGACCGGTCGGTCCTCGAGGCGCAGCACGTCGTATTGCTGGCGTTCCTCGGTGGAGAGGGCCACCTTCAGC 
AACTGCCGCTGCTTGCCCTTGAGGCTGGGCAGCACATCTTTGCGCCTGCGCAGCATCCAGTCACCCAGCTCCGCCCGCAG 
ACTCTGGCGGAACTCCGGGTTGCCGGCGAAACGGTCGCAGAACTCTTTCAGCGGCAGTTGGCCGATGGGGTGGCCTGAGA 
GGCGCAGCAGGGTGTGCAGCTCTGTCTCGCGGTTGAGCACCGGCGTGCCGGTAAGCAGGTAGCGGTTCGGCACTTGGGCG 
GC AATGT CG AAAC CGT GCCGCGTGC ATTGCGCCGT CGGTTC CTTCATCCGCTGCGCCTCGT CG ATGACCAT C ACGGCGAA 
GCGCGAAGCGTTGGCGACGAAGGGGCTCAACTGCTCGTAGTTGACGAGGATCCACTGCGCCTCTGGGGTGTCCTGCTGGA 
TGGCCACGGTGGCCGAGGGATAGACCTCCTGGATCTCCCGCTGCCAATTGATCAGCAGGGTAGCCAGGGTGATGACCAGG 
ATTGGTCTGCCCGCCGCGCGGATCGAAGCGGCGATGACCGCCTGGCGGGTCTTGCCCAATCCCATGTCGTCGGCCAATAA 
GGCGCTGGTTCTCTGCAGCAGATGAGCGATGCCGTCAGGCTGGTGGGCCAGTAGAGAGTAGCCCTGAAGCGCCGCCTCGA 
TATCAGCCGAGCTGTACTCGGTGCGCTCGATCTCCGGCACCGCGGCGAGGTAGATGTCAGAGGCTGACTCGTCCTCCAGG 
GATGGCGCTGCCGGTTCCTGCTGTGGACCGCCGATGCTCATGCTGGGCAGCTCGGTCGACGGCGCGAAGCTGCCGTCGCT 
GAGCAGCTCCTGGACAGTATCCAGAATCTCGAATTGTTCCTCGGCGAGGCCAAGCTCCAGAATCAGGTTGCTTCGGAGCA 
GCTCGGGAGTGGTGTCGATGCGCCAGGAACTCGATGGGCGCAGGAATACCCCGCGCATCCGCCGAGCCACCGCCACAACA 
CCAGGATGGAACTCGCCTGACAGTAGGACGCCGCCCTCGGCCAGGGGCGCGATGCGCAGCAGCATCCCCCAGGTGAAGGC 
CTGTCGGTCAAGCGAGGCCTGCGCGGATTCGACCAAGGAGCTGAACTCCCGCCAGGACTGGGAGTGAAGGCCTCCAGCTA 
GCTCGGCGAGACGGTTGTACAGAGTGTCCAGTTCAGGCAGCAGCTTGCCTTTGGGGACCCGCCAGTAGCGGTGCAACGGA 
TGCTCCCGGGCGCTGATGAAGAAGCCGGACAGCTCCAGCAGCGCGCGATTGCTGCCTGCGATGTAGGGGATCTTCAAGCC 
G AAGT CGT AGC CGTCG AAG AT AGCC CGCTGC AG AAGCGTCG AAGGTGT CGCGG AGGT ACG ATT AACCAGGCGTGG A CGGT 
TCATCGGTCTTCCAACTGAGTGAGCAGCCACTTCGCCGTGGCTTCGGCGAGGCCCGCCTTCCAGCGACCATTGGCGAAGC 
GCGTCATGTGGAGGGTCATGTCGGGCCAGTTCAGGTTCGTCGCTATGGCTCCAGAGCGAATTCGTTCCATGAACCAGCGG 
TATGCCTGGCCGGCAGGGGAGTGGGTCGGCACGGGGGCATTATCGAATTCGCGCAGTTGGACCGTGGAAAGGTGCTTGGG 
ACTCCAGGGCCTACCTATAGGCTCGCCATGCTCACGCCTGAGCTCCAATTCGGCAGAGGCCAGCGGACGACCAGCCTCAC 
GTACTGAGAGCAGTCGGCAGTCTCCTCGGTAGGCGGCCTGGTCGTAGCTATCGATGCAATGTGCCAGAGCGATATGTTCG 
GCATATAGGTCGGCAGGACAGCGGAGCTCGACGATCTGCAGTCCATTAGGGCAATCGATAAGACCGACAGACAGCATTCC 
CTCCCAGGTGGTGTACTCGTCGGTGTCGACCAGATCGGGCAGGCTGCTCTGCACTGCGGCACGCAGCTCCCTCTGGACCT 
GGTGGTAGTCGTCCACCAGGTTGCCAGCTTGTCGGTAGGTACAATGACCCAGATACGCTTTCAGGGCGCTGCGCGCCTGC 
CGAACAACCTGTGAGTTCCCCTGGTCGAGGGCGGTATAGAACTCCTTGATGTCTCTCAGCCGTGCAGTGATTGCAAGCCA 
TGCCGGATCCTGCCAATCCGACGGGCAGCCGTTGAAGAGACGGTTGTAGTCGGGCTTGGCGTTGTGAACTTGCCAAGGGA 
TCGCATTGTAGGCGGCATAGAATGCTGTCCATTGAGCGCGAGTGATCGGCCGCCGGTTACCTAGCGATGCCGCCAGTAGG 
AGAGCATGCCATCCCGAATGTCGGCCTTCCCGTTTGCGATGGAAGAGAGCTCCTCCCGCACGGCCGGGACTGACGTGACT 
AAGAAATCG AATCG AGGCCCG CGG CGCCTG AAAT AG C C AGGCG AAAAAG TTG ATG ACCG AAATTCCTTCGTCCGCG ACCT 
GCCCAAGAATACGGCCCATATCGAGGTACAAGTCGCATGGCATGGCGGCGGCCTGCGGTCGTTCACTGTCAAGCTGGGGA 
AAGGGGCAAGGAAGGTAATGGCGGATGCTCTCGCCGTTGTCGTTGGTCGTCGTGTGAGGCCAGACTCCCTCCTCGCAATC 
CACCAGTAGCGGCGTCAAGACCGGCTGAGCCTTCAGCATTTGCAGCCGGCGCACCTGGTCGCCTTGAGCCAGCCAGTTGT 
AGAGCTGGGGCGAGAGGCACCGAACCGAGCGGATGGCGAACAGCAGGTCTTGGTCCAGGACCTGTCTGAAGGTTCGCAAG 
ACCGCTCCGATCTGCTTCAAGTAGAACTGGTGCCTTGGCGCGTAGTAGCGCTTGTTGCTGTGGCTCGGATACACGAGGCG 
ATGTTGAAGTCCGAGATCGTCCCTGGCCATCTGTGTGGCTAGGTCGTGCGCTCGCTTTGTGTATGAGCTGAGTACAGCAA 
CTGCGTCGTCGAACTTTTGAAGGCGCTGTAGCAGCTTCAAGGCCCCTGCTCGAGACGGCGTGCTTGCCGGGATACGATGG 
GCAGGCTTGCACTCCCACGCACCAATTAGTTGACCGTCGAATACCTGGAGTAGCGGCCCTGCACGCATGATGGTGTAACG 
CACACCGTTGTCAGCAGCGACTGTGATGCACCAGCACCTGGTCGTAGATGTGCTGCGCAGGTAGTGACGGATGAACTTGG 
TTCGAGCTGTCCCTGCGAGGGAGAAATGTGTTGCCCAGGTGTTGGCGACGGCGAGCTGCCGCTCGCTGTAGCGGGCCAGG 
GATGGATCCTGTAGCGGCCCGCTAGGGGGGGCGTTGTCGAGAGGTGCCACGCGCATGCTCCTGTGTAAGCAAGCAGCACT 
TTGGCCGGTCGGTTTACGCTGAATCCATGTGAAATGGTCATCTCAAGCGTGTTGGATTTCAAGGTTCTACAAGCTAGCTG 
ACTGAAGGTCTTATTGGGATGGATCAGTAAATTCTCTTGTAGTCAGCCTCAAAGTCTCCTAGTGCAGCAAACTCGTCAAG 
GATAGCTTTACTTAAAAGAATAAGATGTATTGTCCTCGCTGGGTTGTCCCAGTTGGCACCACCATGCTTGCCTCCGTGAA 
ATAGATTGTTTCGCACGGTCTTAAGTAAGCGAGCGACTCTAGCTAAGTCGCTTTTGTCCTCATCTAATTTAACCGGACGC 
CAACCCAGCTCTCTACCGGGCAGGACTATTTGTTGCTCTGGACTCTGCTCGATTAGTGCTGTGGCTGATTGGGAAAGAGA 
GTATTTGTCAGAATGGTTTTGTACAAAATTTTCCCATCCCGGCTCTGCCCTAGCTCCAGGTTTGTAATTTTTTAAGTAGC 
CATTCTCTTTGAGGCTGAACTCAAATCGCGAGAACCAATAGAAAAAATCAAAAGCTCGCTCTGTGATTTCGGCAGATATA 
TTCTCTTTTCGCATAAACCCTCCTTGATAGCTAGCAGCGAATTGGGCCTGTACCATTTTGCCTGCACTGCGTGACTAACT 
ATAGTGGCTATAGCCCGCGCGGGTTGTCGAAGTCAGGCCGGCCGCTCTTGGTGGGGTAGCTAACCTTGCACCCTGTTGTT 
CGATAGCCGCTGCAACCCCAGAAGTCGAAGCCGCCTTTGCCGGCTTTCTTGCGGTGAACCAGGGGCAGACCGCACTTGCC 
GCAAAGGAAGTCGCTGGTCGGCGCCGAGGAGATACTGGCGGTGGAGCTTGCGCGTGGTTCGTCCTGTGCGCTGGGGAGCG 
CGCGGAGTGTCTGCAGCTCTGTCTGTAGTAGCTGGTAGAACTGCTGGACCACGTTCATGTAGGTGTCCTCGCTGTTGGCG 
ATCCGGTCCAAGGCGACCTCGAGGTTGCGGGTGAAGTCGAGGCTGAGGAAGCTGAAGTTGTGCTCCAGGAGCGCGATGGT 
GGCCTCTCCCAGCTCGCCGGGGATCAGGCTTCGCCCCTTCATCTGCACCTGGCCCTTGTCGATGATGTTCTTCACGATCG 
AGGCGTAGGAGGATGGCCGACCAATCCCGCGGCGCTTCATCTCGCCCACCAGGCTGGCGTCGGTGAATCGCTTGGGAGGG 
GTGGTTTTCTTCTCCAGGACCTCGCCGCTGTAGACCTTGAGTATCTGGCGTGGCTCCAGCGCCGGGATCGGGATGGGGTT 
TGAAGGCGTTTCGTTCTTCTGCTCCTCGGCGTCATCACCCTGCAGCAGTTTTCTCCAGCCAGGCACGTTCAACAGCTTCC 
CTTTGGCGCCGAAGCGCAGCGGCTTTTTGTCGGGGCCGACGCCCAGGAGGGTGATGGTTCTCACTGCGTACACGGCAGCT 
TCGATCTGACTGGCAAGCGCGCGGACTCGAATGAGCTGGTACAGCGCCTGCTCATCAGCAGTTTCACCGGCGGCAGCGGC 
CATCCAGTCGGTAGGGGTGATGGCGGGGTGGCCTTCCTGGGCGTCTTGGTCCGCTTTGAACATCCGCTGTTGCTCAACAC 
ACTTCAGCCCCAAGGCTTTGGCGACAGCACGGATATCCGGCATCGAGTCCTTCGAGATATTGGGGTTGTCCGTCCGGTGG 
TAGGTGATGAGCCCCTGTTCATACAGCCGCTGGGCGACCTTCATCGTCTTGTCGGGTGACCACTTCAGCGCATTCCCGGC 
GGCCATCTGGAGCGAGGAGGAGATGAACGGTGCCGGAGGCGCATCGGTTTCTTCGCTATCAATGCAGGTCTCGACGATGA 
CATTACGTATAGCCCCCACCAGTTCTGCGAGTTGACGATCCTGGACATACGGGAACTCCTCGCTGGCAAACACGGGCACT 
GGATCCCATTCCGCCGTCCAGGTACGGCCGTCGCTGGGCGAAACGAAGGTCAGTTCCACCCCGAAGTGCTTGATTGCTGT 
GAAGGCGCGGATCTCCCGCTCTCGCAGGACCACCAGGTACACCGCGACGGACTGCACGCGCCCGGCGGTGGTCGGCCTAC 
CCATCACGCGCCGCAACTCTGGCGTGACCAGATACCCCACCAGGCGATCGATGACGCGACGGCATTCCTGCGAGGCGACC 
TTCGGGAGGTCCAGGCGACGCGGCGAGCTGAGTTCGGCGGTGATGCATGACTTTGTGATTTCTTTGAAGGCAACGCGCTT 
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GTAGTTCTTGATCCCGGCAGCTTGGAGGATGTGCCAGCCAATGCTTTCGCCTTCGCGATCCGGGTCCGATGCGACGTAGA 
TTT C C AC GGCTTT CTGC CG C AGCT C CTTCAGCCGTGCGA CGG TTTTTTCCTTGCCCG AG AGGATCTGGTAGTGCGGTTTG 
AAATCCTGGCCCACGCCGACGGTGAGCATCTGCGGATCCTGCCCGTGAACGGGTAGGTCTCTGATGTGGCCGGCTGTCGC 
C TCG ACCTGC C AGGT C ACGTCGGG ACG AAT CG AGGGGAG AAGGG ACCT CAGCTTTTTC AGCTTGC CCGGGGCTTC AATGA 
TTACGATGTGCATCGGCGGAATCTCTCGGAACAGTCGGTCGAACTGTCGCGATCAGAGAGCGGGCCAACAATGCGAAATA 
GACGGTGCGTTGTTGCAGTTTTCGGGGCAACTTTCCGCGCATCGGCGAAAAAATATCCGTTGCCGATCAAATCGGACACA 
ACTTACCGATTCCTCAAATGCGGTAATCTTGTCCGTGTGGCTGCTGCAACCATCTACGGTAATCGTGGCAACTGGGTCGT 
CGCT CG CA T AG ATTC AG ACAAGG AAATG ATCGTTCCTGG AAGG AC ATC AATCGAC CCC AATC ATCC CG ATTGGTCGT AGG 
AAGG AAAATGGTATGGATC AAAGCCTTTGCA C ATGC ATGCCAA CGC CAAT CGTC AACC C C AAGG AGCTGCGACTGTGCCA 
CATGTTAGTCGGTAGAACTTTCCCGATAACATTGATCGCAGGCGACCATTGGTTGAGCTATGACGGCAGCGCCTGGTGGG 
TCGATGCGGATGAGCCCGCGACGGAGGACGAGGTGGCGGCTCTGTTGGTCAAGGCTGGTGGTGTCACTACGTGCTGGTGC 
GGATAGGAAAGCGCATCAGGTACACCCAAGCCCCTCCTAACTCGTGGGGCCAAGGCCGTAGGTGAGCCACGACTTACTTT 
GCCCACAATGCGAAATGGACGGTGCGCTGTTGCAGTCTTCGGGAGGAGCTGAAGGGTAAAGTCATCAGAAGTCGAGCTGC 
TGTTGCTGCTCCTGGGCAAACCCTTTCAACCAGGACGCCCGCATCGCAGGATGACTGTAGGGGCAGTCGAACACCCGTAG 
CCCTTTCCGCGCGGCCTCAGCACCAATAGCCCGTACAGTCCGGTGGGGATCGTTGAAGTGTCGTCTGTCCAGCTTGTCGG 
TATCGGTCGCTCTCTTTGACACACTGCACCTCGTACTGGTCGAGATTTCCGATTCGGAAGCCCTTGCCACCAGAGACTTC 
ACGAACGGGCGGCCAAAATTAAGGATGCTGGTCTATCAGCTTTTGCCGCCATAACTGTTCATTGCGGTTGCTCCAGTCGC 
TGGAATGCTTGTCGAGGCTGCGGGCGGCGAAGAGAGGTTCTTGCCACCTGCAACTTCACGAGTGGGCGCGAAAAATTAAG 
ACCGGCGTTTCACCGTCGCGTCGGCAATGCCTTGGGCCATCCTGATGAAGGATGCGCGATCTTCTTCCTCCATAAGCGCC 
ATGCACTCTAGAACTGATATCTCGTCGGCGCTGAGGAGCTGCCTGGTCAACGGCTCATCGCGATCCCGAAATGAGCAATG 
■TGAGCCTGGTTGGTCATCT-TTGCTGTTCATGGCGGAGTT.TCCTGCGGGGGGCT.TTGTCGAAGCTGCGGGTGGCAAAAAGT 
GGTACT-TGCCACCCGCAGCTTCACGAATGGGCGGCCAAAAATTAAGCCGCCTGCTACATGGGGATGTCGTCGTCGAAGGC 
GCTGTCGTAGTCCTGCTGCGATTGCGCATCTTGCTGAGCAGGTTGCTGCGGGACGTTCCGAGACTGTTGATGCTGGGTTG 
GCAACAGGGTGACCTCGGCCAGGCGATGCGGAAGGATGGAAATGCGCGAGGCTTCGACCTTCAGCGCCTGGACTTCTTCG 
CCTGACTCTTTGTCCGGCCAGCGGTCCATAATGGCCCTGCCTTCGACCTTGACGCGCATACCTTTCGTGAACAGTTCGGC 
GAAGCGCTGGGCATCCTGATGCCACCATTCGACGCTGCACCAGAAGCCGCCGCGATCCTTGTAGCCACCTTGGCCATCGG 
GAATCGAGTTGTCGAACATCACGTTCAGTCGCAGCAACTGCCGCGGGGGATTGTTGCCGTTGGGGAAGCTGCGGTGCTCC 
GGCGCCGAGCCAATGTTGGCTTCCCAGAAGACGGGGGTAGCCATGATGAATCTCCAGTCAGTGGTGGTTGGTGCTCTGCT 
GCAGTCGGCCCATGAACTCGGCCTCGGCCTGGCCCATCTTTTCCGCGCACTCGGCGGCCTGCTTGCCGATGGAGTGGATC 
AGGCTGATCTGCATGTTGAGGGTGATGCGCTGCAGCTCCATCTCGTAGAGGTCTTGCAGCAGGTATTCCGGGGTCTTGTC 
CTGGCG CAGC ATTTCCGT C CACA CCCCGACGCCCAT CCGC CGGG ATGCAATGTGCTGCC AGCGG AG AAAGGTTGTGCGGG 
CGGATGTGCGCTGCTCGATGAGTCGGGTGGGCAGTAGAGTGAAGGGTGGGCGACGAACCTGGGCCAGCAGTCCCTGGGCA 
AGTTCAAGCAATCCCTGCTCCATGGCTTCGCACTGTTCCGCCAACTGGGCCAGCTCCCTATTACCTTTAAAAGGCTTTAA 
AAGGCCTTTTAGTGAGGCACTGTGTTGCAGCGCCTGGAAGGCAGTTTGTTGCAGTGGGCGGAAGGCACTTTGTTCCAGAG 
CGCCTTGCTGGTCCTTATCGAGCTGCCGCATTCCGATCAGTCCT CG AGTTCGACACGGTCAGCGTCATCATCGGCATCAC 
CGTCAGAGCGGCCCCGGGTGATCGGCGGAGCGAAGTTCGAGCGTCGAGTGCCCTCCAGGATGTCCTGCGGGATCTCGCCG 
AACTTCTCGTACATCTTCCGCGCGGCTTCGGCGCGAGCATTGTTTGCGGCGAAGTCGTCGCGAGTGGCGCCCGAGAACTG 
GTAGCTCTGGGCCAGACCAAACAGGCTTCGCAGCACCGACGCACCTTCGTCGAGCCAGACCTCCATGTCGCGGCGACCGA 
CCAGCCCGACGTGCTGGGCCAGCAGGATTCGACGGGCGAGTTCGTCGTAGTTGGTCAGGAGATAGACTGCCTTGAAGCCG 
AGAGGGTTGGAGATGAACAACGGGACCTTGACCGGTGTAACGGACAGGTTCTCGGAGATATCGAGGGTCGCGGGCAGCTT 
GGCCATGACGTCATCCAGTCGCTGGTCGATGGCCTCGAGTGCGGCTTGGCAGCTCTCGACTTTCTCTTCGATCTGAACCA 
GCCACCAGTCGGAGTACGGATCGTCCTGTGCTGCCCCGCGGTGCATGCGATTCACGTATGCGCAGAAGCCGGAGAGGCCG 
AGGATGCTGTGCTTGGCATCCGACTTCTGCCGGCCGGTCCAGATGCGGGCGGCGTGATAGGTGTGAAGCGTCAGAGCGAC 
ATCGCTGCGGAGAGAGCCGATCTGAAGCCGATGGGTTTCAGCCATGGTGCTGTGTTCCTTTATAGGGAAGGGGGTCGGCC 
ACATCGTGCTGATCGGGGTAGGAGCGCGCAGTAGGAAATACATGGTTCGCCGCTGGGGAATTTCCCATGGCGACTCGGAG 
CAAGGCACCCAGTTCGCCGCCGGCGTCGAAGTCAGTTCCTAATGGAGTGGTTCCAGTGGAACCACCCAACTCGCCGTCGG 
CGCCGAAGTCAGTTCCTGGTGGAGTGGTTCCAGTGGAAACACTCATCTTGCCGTTGGCGCCGACTTGGGTATCTCTACCG 
TTGGTAATCTGGTTGGCAGTTCATGTAGTTGGCTTTGTTGCATAAAGCTCCCCACATCGCTTTTTCCGCCATTTCAAAGC 
GATACATCTCAACCGTAATGGTCAACACGGTTACTGAATACCCGAACACGATCAGCATTAACAACAGCCTGAGTTCTCGG 
TCGTTCCACGCATCTGTAAAAAGTGCAGAAAGAACGAGAACGACCAAAAAGAAAGGGGCCATGCTCAAGAACATTGTTTG 
AGTTCCTCGTGAGTGGTGATGCCTCGCTTTGGCGAAGTGTTTCGGCCCGAAACACCCATCTTGCCGCCGATACCGAAGTG 
GATCCCTGGCAGAGTGATTCCAGTGGAATCACCCAGCTCGCCGGCGGCGCGGAGGTCGGCTCCTGGTGGAGTGATTCCAG 
TGGAATCACCCAGCTCGCCGGCGGCGCGGAGGTCAGGACCTGGGAGAGTGATGCGGGCCGCATCACCAAAAAACACGACG 
CCCGACCGAGACTTTCTCATGAATCATCACCCCGCTCGGATGGCACGCTCGATCCGTGCCTGGGCCGAATGAGGTCCTTG 
AGCCGGCTGAGGACCTCGCTCCCTGTCCGGGTATCCCCCCGGGGCTGGACCTGAGGCTGTGTAGGCGCTATGGGGCTTGA 
TGGTGCCGGAGCACGAATGGGGCGTTCCGTTGCGGGGATGGTTCGCTCAGCCGGTTCTTCCGGAGCCCAAGAAGCGTTGA 
ACTTGCCCTGGACTGCACGCTGGATGAGCGTCATGAGGTAGCCGAACGGATTGCTGACTGTTCCGCCGGCACAACGGTGC 
TGCCACTGCTCGATGATGGGTAGCCGAAGATCCTCGGACACCCGTCTGAGCGCGCTTACTGCCTGGATCCGTTGCTCGGC 
CTCCAGTGCGTGCAGGAGATCCTGCCAGTTCGGATGGGCTTCCTCGCGTGCTCGCGGTTGTACTGGCTTTTTACATACAG 
AAT CTTGTTTGT ATG T AT ACGT ACT AT ATG AGTTCGGC ATC CG AACC AAGGGCGT ACTCGGCGG AT*T CTGTCGGGGTTCG 
GTATCCGAACTCAGCGGTAGAGCGCATTGCTTGCCGCTGATTTCACTGAGTTCGGCATCCGAACTCGGGGTGGAATGAGG 
TAAATCCGACAGAGTTCGGATGCCGAACTCTGCCGCTGGATCCGCTTCGAGCTGACTATCGATAGCCCATTGCTGGTGGT 
TCAAGCGTCCCTCGAGCTTCTCCAGATGGGTAGGGAGGCGGCGACCCACGTCCGGATCATCCCTGAACTCCCGCCAGATG 
ATCTGCCCGATTTCGCGTATGGCTCGGTTGCCGTGACCGGTGGATTGGCTCAGCAACTGCATGTAGTCGGTGTCCAGCTC 
CAAGGCTTCGGCTGGAGAGACAGGCTCGTCGTGAAGGATGTAAACGTTGCCCTGGACCTGTCCGTTGAGGTCGTTGCGCA 
GGCGTCGGCCGAGGCTGAGCCAGCGGGTCAGACGAAGGACCGTGAGTGCCTTGGCGATGGTTTCGCGCGAGGCGATCTTG 
CCCGGCTGCATACCGAGATAGGGGCGCAGTTGCTCGTATGTCGGGAACGCGGTCAGTCCGTCGTCGTTGATGAGCAAGCG 
GAAAACTTGCCAGCAGTTCCGCTCCAGCGGAGTAAGACGCTTATCCAGGAGGAGGCGCCGTGGAACGGTTTCATGAGGAT 
TGCCGCTGAAAATGATGCCGGAATATTGTTGAGCCAGCGGTTCGCCGCTTTCGGCAGCAAGCCGCTTCTTATACAAATGG 
GCCTCGAGATGCCCCGAGGCCGAGTCCAGCAGTGTCGAAATGGGAAAGCGCGACAGCCTCATGGTGCTGTGTCGCCTTCC 
GACTGAACCAGGTTCAGTAGTTGTTGACGTTCTGTCTCAGCGTTCACCTGCGTTGTTGCGGATGGGGCAGAGTGGGGCAG 
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GTGTGAGCTGACGCTAGCGAGGTAAAGAGTGGATTGGGACTTTTGCAGGCCCGCCTGGCTCTGGCTCGATTTGCCAGACG 
GATAGAGCCCGTCTTTCAACCAGCTCTGGATGAGAGACCAAACAATGGCGAGGCTGATGAATTCTGGACTGTCTTGTTCG 
ATTCCGGCGTTGATTTCCTCTGTGAGGATCATCGCGATGTCCAGCAACTCCATGCTGTCTTCAAGGCGGACCTGATTTTC 
CTGCATCAGGTGCTCCCAGCGGAACCATATCTGGCGCTCGATCTGTTCATCGAGCTGCCGCCAGCGACCGCGGCGCGGGT 
GGATCTCCAGCATCAAGCGCTTGGTGGCGGTGTCCTCCGGCGACAGACCGAAAAAGCTCTGGATCATCGTTGTGGTGGCT 
CCCAGCTTCAGTGCGCGTTCGATCTGCAGGTCTTCCTGCGCGCTGCGCTCGGCTGTGGCCAGGATTTTCTCCATGACGTC 
GATGTTCACTCTGACATCTACCCAGGAAACCGGGGTATTGGTCAGGATGCTCATGACGGCAGGCTGTTGGAGCTGAGCCA 
GAATCTCCGGCCGCAGCCCCATCTCGGCACAACGCCTGAGCTGGCCGTTGCGCAGGTGGTTGAGCACCTGCGTCAGGACC 
GCTTCATTGATCGTCGACTTCGACATGTCGATCCCTCCCCAAATCGTGCGAAATCACTGGTCAGTAGCTGCTTGCTCGGA 
GGCGGCCGGCGGCAGCTCAAGGTCAACCAGGCGTCGGGCAAGACGAATGATCCGGAACAGTTTCACCAGGGCTTCGTCAT 
TGAGACGGCCTGCCTGGCAGGATGGCTTGTCACTTGTGGTGGGTTGGCCCAGCAGCAACTGAGCGAGGCCAGCTGAGAAC 
TGGAATTCACCCAGTGCATCGCGGCAGGTCTGTTGATCGTGAGTGTTCAGAACGACTTGGACCTGCCCGCTGACCGCTTG 
AAGCAGCGTCATGAGGTGCAGCGCTGAATCAGTAATCTCATGGTCTTCATTGGGCTCGCGATACGTGTAACCCACACCGC 
CCTGAATCCTGACGATCTCGCCTGGGGCGCCGACATGCAGAGCGATCTCTTCAGCCAGATCAGCGATGTGCTGTCGCAGG 
ATCTCGGGGGTATCGATCGACCGCTCGATGTACCAGAGATCGGTGATCGGGTGCAGGCCACCGACTTGCACTGGGATTGC 
CTGCAGAGCTGTTTCATCGAAGACAGGTAGATGTTCCCCCTCGAGGCCGGCCAGCCGTTGGCGAGTCTGCTGGATCTTAG 
TCGATACCGGGCTCACGATATGGCCTGCCAAGACCGCCGCACGCTCCTCATCGGGCAATTGCTTTTGTTGGACAGGTGGT 
GG CGGAG C CGG CGGGGGG AGGGG ATT ACCCGGG TTGCT C CTTGTCCTGGACGTTTGGGG AT CTGCGGCGGGGCTCTGTGT 
TTGTGCTTGTCCAGTAGACGACGCAGCAGGGTTTTCCTGCCCAATTGGTGGGAGTTGAGGTGGTGCGGCAGGTGTGGGCA 
GGTCGACCAGAGTGCCGCGGCGTTGCTCCTGCTGCCCGTTGGTAATCTCGAGCAGGATTTGGTCATAACGCAGGCCCAGG 
.GGGCGCTTCATTTGGTCGATGAGTTCGTCCTGGAAACGCTCGAAAATGAATT.CGXCAGGGCTACTGTCGAAGATTGCCAG 
GGTGTCCTGGAACAGCATTTCGAAGTCAACCCCTTCACCAGCGTATAGACGTGCCCAGCAGGAGGATGCCGACTTTCTGA 
GTGACAGGAGTTTCTCGATTTGCGGTTTACCGAGCCCTGAATACAGCATCACAGGAATCGCCGGCGCCAATACCTCAATA 
GTGTCTAACATCTTACTGATATGGGATTGAGATACAGGGTAGCCGTCCGCTTTTAGCCGACGTGCCAACTCGCGCTGGGA 
AATGCTTTCACCGCCGTTCTCTTGTTCGTAAAGAAATTTCGCCTTCTGCACCCCAACCGCGCGCTCGATGAACTTGAGGT 
CGCCCTTCAGATCGTTCTCGGCCAAATGACCGGTCAGCGCGATGATTTCGCCGCGCTGCTTGTCCCAGGGCTTGAACAGG 
CAGTCGAAGCTGAAATAGCGCTCGTCTCCGGTCTCTTTGTAGAGGTCGTTGAGAATTTCCAGGCGAGTGTTTCCGCCGTT 
GCGAATGCGGTATTTGTCCTCTCCAGGGCGTCGAGTCACTGGTGGCGGCGTATCGAGGCCTCGATGTCGGATCGATTCTT 
TCAGCTCATCGTATTTCGGGTTTCGAGTGGTCCGGGGGTTATCGTGCCAGGGGAGGACGTCGTGGAGTGTCAGCCTCATG 
GGCGTGTCGCTGATCGGGTCCGACAACTGCTCCAGAACCTTGGGCGGTGCGGTAAAACCAGGGGCAAGTAGCCGGTCTTT 
CAGATCTTGCGGGTTGATCTTGGCCATACTCATGCCTCCGCCACCTGAGGCGTGTTCGCCGCCGCCAGCTCAGCCTGCAG 
GCGCTGGTTCTTGGCGAAGAGTTCGTCGACCATTCGGTTTGCTCGCGCGAGTTCTTCTTCGAGCGCGTTGCTGCGCTCCA 
GTCGGTAGACGATCTCGCTGTTGGCCGAACGGTGGTTATCCGCGGCTTTTTGCTTTAGCTGATCGCGCAAGCCCAAGGGC 
ATACGGACAACGAACTTATCTTCGGTTTCTCGGGCCATGGTCATTCTCCAATTTGGGCCTTGCGCCGCTTGAGCTGGTCG 
CGGAGTTGAGTGGGGAGGCTGCTGATCTGCAGCGTGTCGCGCTCCTCGTCGTACTCGATCCTGGAGCCCAGCAGGTGCGC 
CTCGAAGCTGATCGACAGGCCTTCGGCGCGGCCGGTGAAGCGGCGGAACTGGTTGAGGGTGCGCTTGTCCGCCGGGATTT 
CCGGCGACAGGCCGTAGTCCTTGTTACGGATGTAGTCGTAGAACGCCCGCGGTTGCTGGTCGTCCATCAGTTCCGAAAGC 
GCGTCGAGGG T CAT CGGCTCGCCG ATGCGCGCCTG CG AGG TGGCGTAGTCG ACC AGCGTCT CGGT CTTCTCGCGGGCCTG 
TTCCTCGGCCATGTCCTCGCTTTCCACGAAATCGCTGAAGGCTTTCAGCAGGGTGCGCGTCTCGCTCGGCGAATCCACCC 
CTTCCTGGCAGCCGATGAAGTCGCGGAAATAGTCGGAGACCTTCCTCCCGCCCTTGCCCTTGATGAACGAGATGTACTGC 
TTCG ACTGC TTG T TGTTG CG CCATTCGG AAATGTTG ATCCGCGCGGCCAGGTGCAACTGGCCG AGGTCCAGGTGGCGCGA 
CGGGGTGACCTCCAGCGACTCGTTCACCGCCACGCCTTCGCTGTGGTGCAGCAGGGCGATCGCCAGGTAGTCGGTCATGC 
CTTGCTGGTAGTGGGCGAACAGGACGTGGCCGCCGGTGAAGAGATTGGACTCCTCCATCAGCTTTTGCAGGTGCTCGACC 
GCTTCGCGGCTGAAGCCGACGAAGTCGCGGTCGCCCTCCAGGTACTCGCCGAGCCAGCCGCTGAACGGGTAGGCCCCGGA 
CTCGCCCTGGAAGAAGCCCCAGGCCTTGTTCTTGGCGTTGTAGCTTTCGTTGAGGTCGGCCAGCAGGTTCTCGATGGCCT 
GGGAGTCGCCCAGCTCGGCGTCGCGCGCGTGGAGCACGGCCGGGGTGCCGTCGGGCTTCTTCTCGATCAGGTGGACGATG 
GCGTGTTTGATAGGCATCAGAAAGACTCCTGGTTCAGGGAACGCTGCCTGTCGGCTCGACGGAGCAGGGCGCGGGCATTG 
AGCTCGCCGCGTGCGTCGCTTGGCGTGGAAGAAATGAAGATCGAGGCGCAGCCGATCTTGCTGAGCTTCAAGTGGCCGCC 
TGCGGAGCGGTCGACGCTCCAGCCTTCTGCTTGAGCGAAGGCCACCAGCCGGCGAACGCTTTCCCGGGCGCCGCGAAGGA 
AACTACGCATTCGTGGTCTCCTCGCTGTTGAGCTGCAGCCAGGTCTCCATCTCTGCAAGGGAGCGGGGAAGGCTATCGGT 
GGCGGCGGCATCCAGGAGACGTTTCACCAGCGCCTGAAGGCTGCGACGATCGCTCTGCCGGCTGCAGGCCAGGTCGTCAT 
GCGCGGAACGCAGCATGCGGACCTCTGTGCGCAGTTGAGTGCATACGTCGTCAGTTGCTACTGGTGCCGTTGACGCGACA 
GGTGACCGCGACCGTGTCCTGAGGCGGTTTTCCCACTCTTCCCTTGCTGCCGAGAAGCGATGGCCACGACCTACACTCCA 
ACCGCTGGTCCAGTAGCGCGACTTTCCGCAGTCAGGGCAGGGCTGTGCGCCGCTGCCATCCAGGTGATAGCCGCAGTTCA 
GACAGATCCACGAAGGACCTTCACGCAACTTGAGCTCAAGCATCAGCTCGACTTTCTGGAAGTGCTCGAGCGGCCGGTAG 
GCTGTGACGTCAGGCATCGCGCGCTACCTCCTTGGGATTTCTACAGGACAGCGCCCAGCGGCTCTCGATGCGCTTTTCGT 
GGTCGCCTGCCTGTACCGGCATACGGGTGATGTATGCGTTGTCCTCGCCAAACAGTCGTTCGGCAAGGCGGGAGGCTGCC 
ACCTTGTCGCTCCAGGTGCAGGTCGCACGCTGCTTCTGAATGGTATTGGTCCAGAAACCGGCGCCATCGAACTTGACGAT 
GCACTGATAGGTCTGGGACGCCTTCATGACTCGGCCTCCTTGCTGAGTGCATCACCGACCTGAACGGCCACCAGTTCGGG 
C AACTCCGGCTC ATGCCCGGG AT AGTTGT TGGCC AACTGATGC AGG AACTCGCGCGCGGCGT C CTTGCTGC CTGC AATGC 
GGAACACGGCGGTGAGGTGCTGGGAGTCCTGAGTGCCGCAGCAGAGGTGCGGCACCAGGCTCACTTTCAGCGAGCAGCTG 
AACAGGTGTCGCGGGATCCATACCGACCAGAAGTAAGCCGAAAGCAATTGCCCCGGAGCGCGCGTAGTGACCACCAGAGT 
C CCGGCCTGGG TGG TCG AG ATTTCGATGT ACC C CAGTTCCTCCAGCTCACG AAGCTCTCGCT C CATCG AACTGGGTG CGA 
CCGTGCTCTCGTCCCGCGAAAGGCGCGAGATCACTTGGCTGAGGGCGTCCAGCTCGATGGGGTTGACCCCAACGTGATTG 
GCATGAACCACTGCCGAGTCGGTGAACGCGGCCGAGGCGTGGATACTCGGAAAGCGGGCTTCAGTGTTCATCAAAGACAG 
CCTCCAACTGATGGACGTCGTAGCTTCCGCAGCAGGGGCAGACCAACACCTCGGCTTCTCCTTCTGGGGTGTTCTCGGCG 
GGGAGAACCTGGCGCACCAAATGGTGTGGGTGGTGCTGGTCACAATTCATGCACTGGTGGGTGACCAGGGGCTGGGAAGT 
GTTCATGCATGGGCTCCTTGCCAGGTCCCCGGCGCCAGGTCGGCAAAGCGGGTCTGGTTGGCGATGAAGGCGGTGTGGAC 
GGTGCCGATCGGACCGTTGCGATACTTGCCGATGATGATTTCGGCGATGCCCTTGGCCTCGGTGTTGGGGTGGTAGACCT 
CGTCGCGGTACACGAACATGATCACGTCCGCGTCCTGCTCGATTGCGCCGCTCTCTCGTAGGTCCGCGCAGTTCGGTCGC 
TTGTTCGTCCGGTTCTCTAGGCCGCGATTTAGCTGGGATAGAGCTACGACGGGACAGTCCATCTCCTTGGCCAGCGCTTT 
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CAGCGAGCGGGAGATTTCCGAGATCTCGGTGGCTCGATTCTCCAGGCCTGGGCAACTCATCAGTTGCAGGTAGTCGACCA 
ATATCAGCGCGGGGTGTCCGTACTTCCTGGCCGCCCGGCGAACCTTGGCGCGCAGCTCTGTCGGCGTGAGGTTGCCCTGA 
TCGTTGATGACCAGCCGGCTGCCATAGTCGTTTATGCGCTGGATCGCGCCAGACAGTCTGGGCCAATCCTCTTCTTGCAG 
TTGGCCCTTCATCAGCTTGCCCAGGTCCAGGTGGCCGAACAGGGCGGCAAGCCTGAACAGCAACTGCTCTGCCGGCATCT 
CCATGCTGTACACCTGAACAGACTTCTGTTGGTCGCTCTGGAGCGCGGTGTCGACCAGGTTGAGGGCAAACGACGTTTTG 
CCCATCGCGGGGCGGGCACCGACGATGATGAGATCCGACTTCTGTAGTCCGCCGGTGAGTGCGTCGAGATCCTTCAGGCC 
AGTCGGGACCCCCGTTACCGTCACGTTGTTGTTGAAGCGGTAATCGATGGTGTCGACGATCTTCGTGAGACTCTTGTTGA 
TATCGACGAAATCGGCGTTGTGGTGGTCCTGGCCAAGGGCCGAACAGCTTCTGCTCAATCTCCTCCTGAACCTCAGAGGG 
ATTTG CCTGG TGGTTCG AGGCGGTC CTGGTGCAG TGG TGCCCGAG AG AC ATCAGCTGCCGCAGGTGTGCCCG ATCGCGAA 
CGATCTGCGCGTAAGCCTCGATATTGGCCACGGAGGGCGTGTTGTCGGCGAGCTGGCCGAGGTAGGCCAGCCCGCCAGCT 
TCTGG AAGGT CTTCGATCG CTTCCG A CA CAGTC A CGA C ATCAAACGG AGCATCCTTCGCGGCCAA CTCGCTGATGGCGGT 
GAAGATCAGCCGATGCTCATGCCGGAAGAAGTCCTCCTTCTGCAACTGATCGCCGACAATGTCCCATGCCGCGTTGTCCA 
GCATCAGGCCGCCGAGTACCCCTTGCTCCGCCTCGACGGAGTGCGGCGGCGTAAGTTCCAGAACACTCACTTCGAGTCCT 
CCCGGCTGATCTGCAGAAACGTGCTT.CCTCTGCCGCCCGGGGAGCCGATGATCTCCAGCGCTTTCACTTCCACCTTGGCG 
GACTCGATGATCACCGAACCGACATGAGCCACCGCTTTCGCGCGCTCGATATCCAGGGGCTTGTCTTCGTCAAGCAACCC 
TTCGATGGTGGCAAAGAGGTGATTCCGCAGATCCTCAATCTTGTTCTTCATGCTCGACCTCCTGAATGGTGCGTTTGAGC 
TTGCTGATGACGCGGATTGCGCTCTTCAGCTCAGGTGGATAGCGATGGATGGAGTTGCGGCGCATGCGTTCGGCCCGGGT 
GATGAGCTCCAGGTTGTCGATGACGACGTTCTGCTTGTTGTTGTCCTTGAAGCAGACGCAATGGCCGGTTGGGATGGGGC 
CGAAGTGTTCTTCCCAGAGCAGGATGTGGATGCCCTTCCAGTCCCGGGGGGGATAGCCGGTATCCGAGATCTTTCGTTGC 
AGGTAGCCATCAGCGCTGACCCGCGTGCTGCCGACTGGGAGCCATGTGTGGGGCTTTTGCCCCTTCTTGAACTGCGTTTC 
^ TGGTGCGCGTCCTCGCGCGGGCAGCCCCTTCATGCCGCAGTTCCATGGGGTGTTTC 
XCCAGTCCTGCCGGATCTTCTTGGAGAACTCAGGGCTTTTTTT^ 

AGCGTTTTGTTCAACCTGGCTGCCAGGACCTCGTTCGGCATATCCGGATACAGGCGCAGCAGCACCAGTTCCTGTTCGGC 
GGTCCAGGTTTCGCGGATGCAGCGCTCCTCTCGGTTGCTGAATTTGCGGCGGGCACGCACCAGGGCCTCGTAGGCGATGG 
TCG AG AGAG CAG AGCG AGTCATTGGG AAC CTCCCTG CT CG AGCGCGCCCTG AACTCGCCGGCTGACACCGCGGTGCAGGC 
GAGTGTTTTTGCCGTGCTCCCAGCCTGCTTGCAGCGAGGCCTGGTCATAGGCAAGGGCCTTCGTAAGCGCCGCCGGCTCC 
AGGTACTTCAACGCCGGATGGTGTAGTTCGAGGTAGGCCTTGATGGCTGCTTGAGTCGACTCGTCCGGCCTGCCAGCAAA 
TTCACGTACCAGCGAACGCACTGCGAGAAGCCAGCCTTCGACGAAGAGCTTGCTGCGACGACGCTTGGTCGACAGCTTGC 
AGCGCTTCTGTTGGGCGACATGAGCCGAGCGCGCTGCCACCAGTTGGTGGTGGAGCGTAGAGTAGGCGTGAGCGGCCAGC 
TCAGGGCCGATCCCTCGGCCCATGAACTTGAACGTCCAGCCCGCTGGCATCGCGTATGCCGCCAGGTAGTCGCAGCCGAA 
GG AACTGGCG CAG AT CC CGGCC AGTT CATGCAGCC ATGGGG CAGG AG AGCGCCTGGTTCGAGTGGGAACGCACGCGGTTT 
CCACCGTGTGGGCCGAGATCTCCGCATCGCTGAGGTCGAACTGCCGCTTCAGGATCGCGGCCTGGCGCAATGCGGTTTCC 
GCCTCGTTCGGCGTGGCCCCCTTGGCATTCGCCAGGGCCATCAGCTTCTCGACCTTGGCGAGGACTTTTTGCTTGTCCAT 
TACGCGCCCCCCTTGCAGATGGTTGAGCCCTGCTCGGTGGCGGAGCGGTCCAGGCGCTCGATCTCGGCCAGCACTAGGGC 
GCCGGCGCGTACCAGGTTCTCGCGCGCCGACTTAGGCTTCCACCACTCTTCGTCCCAGGGCCAGAACAGTGCAGAGCCAT 
TCCGAATGCCGCCCGGCCAGTCGGTGCCGATGCCGCCGGCATGGAGCGCGTAGCAGCCGGCCGCCAGGGCGATCAGGCCA 
GCGCTAGCGTCGTCGTTGCTGGTATCGAAACCTTCGACGGTGATCTGGCGCCGGCGCTCGGCCTGTACATCGAGCCAGGC 
CTGCGGAACAGGGGAGAGGTTCGAAGTTGGGTAAATTCGGTCTGTGGCCTGTCTTACCCGATCGACAATAGCCAGGGCTG 
CTACGTCGCCCAGTGCGACCTCCAGGGCGGCGAGCAGGTTGGCCATCAAGAACTGCTCACGCTCGTCCGCGTGGTTTTTT 
GCGCTTGGGCGTCTATTCCAGTCGATCTGTGCTGAATACCTCGCTGTTACCGGGTCTGGCTCGCAGTAGGACGGCCCGAT 
GTGAACCTGGCAGTCCAGGCATTCAATGGCACAGTTCCAGGCATCGGTGTCGAGCAGCTCGGGTTCCAGCATTCGAATGT 
TCGTGCTGCCGCAGAGCGGGCAGGGCTTGAGCTTGGGGTCACTCATGGCCTGCCTCCCTGCCCGTCGTCGGTGCTGGGCT 
TAGCTTGGGAGGAGCCGGCGACGAGCAGGTCGGCCAGGTCCGGATAGTTCGGATGGGTCCAGAAGCCGAACTGATCGCGC 
TCGACCAGGTGGGCCTGAATCATAGGTTTCATGCCAGGCCTCCCACGCTCATGCCGAGCACCGCATTTGCCTCGGCGGCC 
AGTTGCTGGCCTTCAGTGCTTGATGAGCTGCCGACGAAGAGCATCAGCCGTTCGATGACCCTCACAGTGTCCTGCGAGGG 
TTGGGCGAACTCGGCCAGGTGATCGATGGCAGTAGCTATGTTCCAGATCGCCCTGTTCATGTCCCGCTGGATCACTTGGA 
TCGAACTGGCATTGGCCGCGATGGCCAGTTGCGATTTCGTCTGCAACAGTAGGTCGTGGGCGATTTGGAGCAGTGCGTTC 
TGCAGGGACTGTCCACAGGCACGCTGCTGAGCGCGAACCGGAGCGACCAGGCCCTGCAACATCGCAGCACCTTCAGCCTC 
G ATTTG CAG ATTCTCGGAAG C AC C GGCGGCAGCGC CTGCTTTGGCCATGTAGC CTTGCACGTAGGCGT CGAACTGGGTCT 
CGGCGATCAAGCTCAGTAGGAGGTTGTTGCGGTTCTGGAGGTTCATATGAGGCCCTCCGGAGCCGATTGCAGAGCGTCTG 
TCCAGAGCCGAAGGGTTTCCTGCTCCTCGCTGGTCAGAGCGATCTGTGCGGTGCAGTCAGCGACCTTCTGCTTGATGACG 
CGCAGGCGCTCCTCGACTTCACTGCCATCGTCGTACGCGATTTCGTTAGTCAGATGGCTAATGAAGTCACATCGCAGGCT 
CCCCACCTGACTAGCGAAATACTCGGCTTCACTGCGCCATCCGATGAGCTGCTTTGGACGCTCCAGCCAGCGTTTGAGTT 
TTGAGTGGAGCTGGTGCGTGGAGTACAGGTGATCGAACACCCTGGCCCGGTTGTAGATCAACGCGATCTGCAGGATTTCC 
AGCCGATCTTCGAGCGTATTTCCCGGAAGCGGGTAGTTCGAGGCGACCAGTTGCTTGGCCAGGCGCTCACGTAGCTCGCG 
CCGCTTCTTGGCTTTTTTCTCGTAGCGATCACGCTTTTCCTTCGCATGCTCGATCTGCTTGTTCACCGAGTCGAGCAAGC 
GCGCTGCGTCTTGTAGGAGCTGCTTTTGTTCGAAGCTCAAGTAGCAACCCAGCTTCCCGAACAGGTCGATACCGCCACCA 
GTCAGCTTGCGTATTTCTCGCGTAGCGCCGGTCAATTTGGCCTGCTGGCTCCGAAGGTCGGCGGCGCTGCCGAACTTGTT 
CAGTTCCAGCACAAGTTCCGCGTCGACGTCGCGGGCTTGGGTGATAGGAGTCTTCGCCATGTCAGCGCCCTCCCTTGACC 
AGTGCTGCAACCGCCTCCCGGCGTCAGCGCCAGGAAGCGGTCAGTCCACTCGGGAAAGACCTCGATGGCCAGGTTTCGAA 
TGATTTCCAGCGCGGGCGCTGATGTGCGATTGGAGGGTTGCCGCGTTTCGAGGCGGTGCGCTGGTAGCCCGCGCGATGCT 
GCGTTGCGAAACACGACGGCATCCGGGACAGTCGTTTCGAGCACAGAAATGTCCTGATGCTCATCGAAGATGGCACGCAC 
ATTCTCGTGAATTGCCCGGGAGTCATTGGTCTGGTCCAGGCAGTTGATGACGATCTGAACATTGGGGATCCGCATGCCGA 
GACGCTCATAGGGGCGTAGGCCGTCGAGCATTTGCATGGTGCCGCGGTTGAACTCACGGGCGGTAAGCATGTTGGGTTGG 
AGGGGGGAAACAACCAGGTCCGATGCAAGCACAACCATTTCGAGCAAAGCTGAGCGCGCACCCTGGGTGTCGATCAGCAC 
CAGGTCGTAGCCTTCTTTCAGAGCGGGCATCAGGTTCGCCAGGCGTAGCCGGCCATCGGGCGCCTGGAGCAGTAGGTTGT 
TGAGCTGATTGTTCTGGTCGTTGGAAATCACGACGTCCAGATTGGGGATAATCGTCCTGGAGATGATCCTCGCCGGGTCC 
GTTATGTTGGCGGCGAGCAGGTCGTAAATGCCGCCCTGGGCAACTTCCGGCAGCTCGTAGTACGAGGATAGGGAGGGCTG 
GACGGGGTCCAGATCGATGAGGAGGGTTCGTATGCCTGCATCGGCGCAAAATGCACCGAGGTTGGCGGCGGTGGTGGATT 
TTCCTACACCACCCTTGGTGGATACAACCGAAGTCGCTTTCGCATTCATGTCTACAGCCTCTAATGTGGGGTCATTAGAG 
GCTGCTAAACAGACTCGTTGTAACGGGAGTGGAAACCCTTCCTAGAGAAACCTACGACCAGTTGTCTTTTAACCAATTGG 
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FIGURE 3 
RL024 

DNA sequence (SEQ ID NO: 3) 

GTGGCGCTGACCGGTAATCCCCTCCTGAAATTGCTGGTCGTCCCCGTCGTGATCGGCGCCATCCTGATCGGCGTGAGCAT 
GATGGGCAAGAAAGAAAGTGCGCAGTCACAAGGCGCCGCAACCCCGACGGTAACGTCGGAAGAAGCGGCAACCCTGGGCA 
TCGACGGCGACACGCCCGCCGACACACTACGCACCATCGTGGCGGAAAGCCGGCAGCTCAAGGACCAGATCAGCAAGGTG 
ATCCAGGAGAATGACTCGCTCAAAGCCGCCAATGAGAACCTGCAGGGCCGCCTGCGCAACATCGATCAGAACATCGAGCA 
GAAGCTCAACAACACCGCCCAGGAACTGCAGCAACAGCAGGAAAACCGTAGCCAGACGATCCTGGACCAGGTACAGAAAC 
GGCTCGAGAACCTAACCCACATTCCCGAGGCCGGTGACACCGACCTGCCCGTAGGATTCGGCGTGCGACCAAAGGATGGC 
CAGCACTTTCAGGGAGCGGGCTCGTCTTCATCGGATATCGTCTGGATCGAGCCCCAGGACGCCCGCGCGGTTGATGCCAA 
TGGCCAGCCGCTGGCCGCCGGCTCCACCACCCAACCGAGCGGATTCAGCTTCCCGACCTCCTTCGGCAATGCGGTCGATC 
GCGGACAGAACGCGCTGGAGCGGATCGATGACGGGCTGCACCCCGTCGGCCAACAGCGATCTGACCTGGAAAACCGCAAG 
CTCGTCCGTAAGACCTACACGCTGCCGCAGAACTCGACGCTCATGGGCTCGGTGGCCATGTTTGCGCTGATCGGTCGTGT 
GCCGGTCGACGGGACGGTCAATGATCCTTACCCGTTCAAAATCCTCATCGGCCCGGACAACCTCACCGCCAACGGCATCG 
AGCTGCCGGACGTCGCCGGCGCGGTAGCCAGCGGGACCGCCTCGGGCGACTGGACACTCTCCTGCGTGCGTGGGCAGATC 
CG CAGC CTCACG TTCGTGTTC AACG ACGGG ACCG TG CGCA CCTTCC CGG CG CCGGCCGAGGAGGTG AATG AC AACC AG AG 
CAACAACAACCAGACCGCCAGCGCCGACCAGAAAACCATCCAGGGCGGCCTCGGCTGGATCAGCGACCCCTACGGCATCC 
CATGCATCGCCGGTGATCGCCGATCCAATGCCAAGGAGTACCTGGGCAATCAGAGCCTACTCACGGCTGCCGGGGCCGGC 
ATTGCCAAGCTCCTGGACGCCGACGAGAACAACACCAGTACCGTCTTCAGCGGCAACGGCACCAGCTTCGGGACGACCGG 
AACCAACAGCAACTCGGCCCTCAACAGCATCCTCTCCGGCGGCGTCAGCGACATCCGGCAGTGGATGAACAAGTTGTACG 
GGGAGGCCTTCGCCGCCGTCTACGTGCAGCCAGGTGCGCGGGTCGCAGTGCATCTCGATCAGCAACTGGCGATCGACTAT 
GAACTCAAGGGCCGCAAGGTCGATTACAGCTCTGGAGCCGCTCATGCAACAGCAGACTTGGACTAA 

Protein sequence (SEQ ID NO: 127) 

VALTGN PLLKLLWPWI GAI LI GVSMMGKKES AQSQGAAT PTVTSEEAATLGIDGDTPADTLRTI VAESRQLKDQI SKV 
IQENDSLKAANENLQGRLRNIDQNIEQKLNNTAQELQQQQENRSQTILDQVQKRLENLTHIPEAGDTDLPVGFGVRPKDG 
QHFQGAGSSSSDIVWIEPQDARAVDANGQPLAAGSTTQPSGFSFPTSFGNAVDRGQNALERIDDGLHPVGQQRSDLENRK 
LVRKTYTLPQNSTLMGSVAMFALIGRVPVIX3TVNDPYPFKILIGPDNLTANGIELPDVAGAVASGTASGDWTLSCVRGQI 
RSLTFVFNDGTVRTFPAPAEEVNDNQSNNNQTASADQKTI QGGLGWI SDP YGI PCI AGDRR SNAKE YLGNQSLLTAAG AG 
I AKLLD ADENNT S T VF S GNGT S FGTTGTN SN S ALN S I LS GG V S D 1 RQV3MN KLYGE AF AAVYVQ PG ARVAVHLDQQliAI DY 
ELKGRKVDYSSGAAHATADLD . 
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FIGURE 4 
RL025 

DNA sequence (SEQ ID NO: 4) 

ATGATCCGGAAGTCGACAGGCTCGCTCTTGCTAATGCTTGCCCTACCCACACTGGCCCACGCGGTGGAGATTCTGCGCTG 
GGAGCGCATTCCGTTGGCCATTCCATTGACGGTCGGCCAGGAACGCATTGTTTTCGTCGACAGAAACGTGCGAGTTGGGG 
TTCCTCGGGATCTGCAGGGCAAGCTGCGCGTCCAGAGTACCGGCGGCGCACTCTACCTGCTCGCCAACGAGCCGATTCCT 
CCAGCGCGCCTGCGCCTACAGGACGCGACCAATGGCGAGCAGATGCTCATCGATATCGCCGCCACCGAAGCAACGGCCGA 
CCAACAGCCGCGCGAGCCGGTCAGGATCGTCGCCGGCGAGCCAGTGGATCCGCATTATGGCCAGTCCCGGGAAGCCCAGC 
CATCGGCAGCAGCGAAACAGACCGAGCACGCAGAAGCACCGAAGGCCGTGCCGCGCGAAACGCCCGTCCCCGTGGTTCTG 
ACGCGCTATGCGGCGCAGATGCTCTATGCCCCGCTTCGCACGGTGGAACCGGTGGATGGCGTCGGTCAGGTGCGCGTCAA 
GCGACAGCTCGACCTGACCACCCTGCTCCCCAGCCTACCCATCACGGCTACCGCCTTGGGCGCCTGGCGGCTGGACGACT 
ACTACATCACGGCGGTGAAGCTGCAGAACGCCAGCGCCCAGCACCTGGCCCTGGATCCCAGGGACCTGATGGGCAATTTC 
GTCGCCGCGACCTTCCAGCACCCGTACTTGGGGCCCCGGGGCGACGCCTCCGACACCACTACCGTGTATCTGGTGACGCG 
CGGCCGCGGCCTTGCCGACGCGCTCCTGCCCTCCTCCATCAGCCAGATCGATCCCAAAGGAGGCCGTCGTGGCGCTGACC 
GGTAA 



Protein sequence (SEQ ID NO: 128) 

MIRKSTGSLLLMI^LPTLAHAVEILRWERIPLAIPLTVGQERIVFVDRNVRVGVPRDLCGKLR 

PARLRLQDATNGEQMLIDIAATEATADQQPREPVRIVAGEPVDPHYGQSREAQPSAAAKQTEHAEAPKAVPRETPVPWL 
TRYAAQMLYAPLRTVE PVDG VGQVRVKRQLDLTTLLPS LP I TATALGAWRLDD YYI TAVKLQNASAQHLALDPRDLMGNF 
VAATFQHPYLGPRGDASDTTTVYLVTRGRGLADALLPSSISQIDPKGGRRGADR. 
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FIGURE 5 

RL026 : DNA sequence (SEQ ID NO: 5) 

ATGAGTTTCAGAAAACACACTGCGCAACAGCAGGCACACATCAACACCTTCCGGTTCATCACCGGCTTCCTGTGCATGGT 
CATCGTTGTGCTGGCCTACTGCGTCTGGGAAGCCCGTAAGGACCTCTGGATCCACATTCCGCCCGACTTGCGCTCAGGAA 
GCACCCGGTTGTGGTGGGACATTCCGCCAGAGAGCGTCTATGCGTTCGGCCTCTACATCTTCCAGCAGGTGCAGCGTTGG 
CCCAAGGACGGCGAGGTGGACTACAAGGGAAACCTGTTCCGCTACGCTGCCTACCTCACTCCCTCCTGCAAAGTCTTCCT 
GG AGAAAG ACTTTG AG TTTCGT CG T AACGC CGGCG AG CTC AGGGGTCGCGAG CGCACCACCTCGG AAATCC C CGG TCGAG 
GCATTGGCGAGAGCAATGGCCGCGTGATCCAGCACTCGATCAATGACTGGACCGTCAACTTGGACATGGACAGCACGGAG 
T ATTACGC CGG CGAG AAG ATC AAG CGGG CG CTGG C CCG CT AC C CGTTG C ACGTTATC CG CGCCG ACGTCG ACCCGGAAAC 
CAATCCCTTCGGCCTGCAGTGGGACTGCTACTCCGACACGCCTCAACGTATCGAGCTTGAGGAGCCGGCCGCCCCCACCA 
AGCGGGAGGGAGGTCTATGA 

Protein se^ence (SEQ ID >70: 12S) 

MSFRKHTAQQQAHINTFRFITGFLCMVIVVLAYCVWEARKDLWIHIPPDLRSGSTRLWWDIPPESVYAFGLYIFQQVQRW 
PKDGEVDYKGNLFRYAAYLTPSCKVFLEKDFEFRRNAGELRGRERTTSEIPGRGIGESNGRVIQHSINDWTVNLDMDSTE 
YY AG EK I KRALAR Y PLHV I RAD VD P ETN P FG LQWDC YSDT PQR I ELEE P AAPTKR EGGL . 
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FIGURE 6 
RL027 

DNA sequence: (SEQ ID NO: 6) 

ATGCCCGAAGAACATCTGTTTCAGGATGGAACCCTCAGCTTCCTGCCGACCC 
GTTTGAACCGGCAACCGGTAGTCATCGG 

CGGCCTGACCGCAGACGAAATGTGGATCACGGTCTTCACCAGCGGAGCAGCC 
GGGTTCGTTCTTGGCATCCCGGCTGCCT 

TGGTCGCAGGTAACGCTGCCTGCATTCCACTGGGCGCGCTGCTGGTCGGCGC 
CCTCGGCCTAGGTATCGGCAGCCGCGTC 

CTGCGGCGGATGAAGCGGGGGCGGCCCGATACCTGGTTCTACCGCCAGGTGG 
AGATGGCCCTCTCGCTGCGCTTTCCCGT 

jCTTCGG CAACCGT CGCCTGGTTACGCGCTCCGGCGCCTGGACCAGTCGACGC 
ACGGAGTCCCCATGA 

Protein sequence: (SEQ ID NO: 130) 

MPEEHLFQDGTLSFLPTRLNRQPVVIGGLTADEMWITVFTSGAAGFVLGIPAALV 
AGNAACPLGALLVGALGLGIGSRV 

LRPvMKRGPJ'DTWFYRQVEMALSLPvFPVFGNRPvLVTRSGAWTSRRTESP. 
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RL028 

DNA sequence: (SEQ ID NO: 7) 

ATGCTG AAACT C A C CCTCCAG AAACTGTCCGCCCTCTG CCAG AGCCTGG C CGCCATCACTTTGGCGCTCCCCGGTATCGC 
CTTGGCTGCACTCCCCAAACCCGAGGCACCTAGCCGTGGGGAGGGATCGGGCATCATGCAAACCATCCAGAACTTCGGCT 
ATGACGGAGCGATGCTCCTCGCGCTGCTCATCTGCGCGGCTGTCTTTCTGGGGGTCGCTTGGCATACCTACGGCACCTAT 
CACGCCATCCATGACGGGAAGAAGAAGTGGTCGGATCTCGGAGCGGGCGTAGCCGTAGGTGTCGGCCTGCTGATCTTGAT 
CATTTATCTCGTCACCAAAGCCACCGCCATCATGTAA 



Protein sequence: (SEQ ID NO: 131) 

MLKLTXjQKLSALCQSLAAITLALPGIALAALPKPEAPSRGEGS 
HAI HDGKKKWSDLGAGVAVGVGLLILI I YLVTKATAIM 
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FIGURE 8 

RL029 

DNA sequence: (SEQ ID NO: 8) 

ATGAGCATGAGCGGAGCCCAGACATCAGCGTTCCAGGCCGCCGCTGGCTTTCCCCCATCGGCCGGCGAGGGACTGTTCAT 
TGG AG CAG CG ATGACCTTCCTTCTGCTGTGGTC CG C CTGGGCG ATGT AC AG C ACCTGG CGCGG CTGGGCC ACC AAC AACC 
TTCGACAGCGCCACCGGTGGCGCTTCCGCGATCCCGGATCTTGGTCCTCCTCGGCATCACCTCTTTCTTCCTCCTCAGCT 
GACCCATACGGAGACACTCATGCTGAAACTCACCCTCCAGAAACTGTCCGCCCTCTGCCAGAGCCTGGCCGCCATCACTT 
TGGCGCTCCCCGGTATCGCCTTGGCTGCACTCCCCAAACCCGAGGCACCTAG 



Protein sequence: (SEQ ID NO: 132) 

MSMSGAQTSAFQAAAGFPPSAGEGLFIGAAMTFLLLWSAWAMYSTWRGWATNNLRQRHRWRFRDPGSWSSSASPLSSSSA 
DPYGDTHAETHPPETVRPLPEPGRHHFGAPRYRLGCTPQTRGT. 
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FIGURE 9 

RL030 

DNA sequence: (SEQ ID NO: 9) 

CTGATCTGCACGAGATTCGCCGTGAACACTCCACATCCATCCCTTCGCCGAAGCTGCCTGGCCGTCTTGGCCTGCAGTGC 
GCTGGTCGCACAGGGAGCTTTCGCAGCGAGCGCCTCCGAGCAGGCGAACCTGGAGGTGATGATCCGGCAGCTCAACGCCC 
TCGAGGACACCGCCCGCCGCAGTGCCCAGGGCGCCGATGAGCCCGGACAGCGCTTCTACTTCGACTACCCGCGCCTGGCC 
GCTGACCTGCAGCGCATCCGCCAAGGCCTGCAGGACTACATGACGCCCAGCCGCGCCCAACCGCGTGACCCTTCCGACTT 
ATCAGGGAATTACACCCTGCGCGGAGGGCCGATGCCATGA 

Protein sequence (SEQ ID NO: 133) 

LI CTRFA VNT PH PS LR R $ CLAVLACS ALVAQGAFAASASEQAN LEv *u RQLNALEDTARRSAQGADEPGQRFYFDYPRIA 
ADLQR1RQGLQDYMTPSRAQPRDPSDLSGNYTLRGGPMP 
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FIGURE 10 

RL031 

DNA sequences (SEQ ID NO: 10) 

ATGAGCATAAAACAGCCCTTCGAATACCATGTCGAGAACATCGTCATTCCCTACAAAACCCTCACCAAGGGCGTCGCGAT 
GTTCAAACACAAAGAAGACACCTTGGAACCCGACGACCACGCCTTGCTCAACCCTCTGCGCTGGGCCGAGGTCGTGCGTC 
TGGGCCAGGAAGGCTGGGAGCTGGTGAGCGTTCAGCCACTCATGCGGGGCGTAACCGAGATCGGTAATCAAAACGCCCAA 
GGCTGGGCTTGGGGCGTCGCTCTGCCCGTCAGCTACCTGCTGTTTTTCAAGCGCGCAACCTCATAA 

Protein sequence: (SEQ ID NO: 134) 

MSIKQPFEYHVENIVIPYKTLTKGVAMFKHKEDTLEPDDHALLNPLRWAEV\^LGQEGWELVSVQPLMRGVTEIGNQNAQ 
GWAWGVALPVSYLLFFKRATS . 
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FIGURE 11 
RL032 

DNA sequence: (SEQ ED NO: 1 1) 

ATGCTTAGAAACATCTCTATTGGAGTTTTGCTAGCCATGGCTGCTATGTTGGGCAGTTATGGGGTGGCTGCCGCTACATT 
ACGATGCGGGTCGGCAATTGTTAGTGAGGGCGACTTGATTGATGATGTGCTTAGAAAGTGCGGCAACCCTGATAGCCGTA 
AAATTGAAGGGCCCGCAGTGGATGGTAGTGGCTATATAGTGCGGGGGGCTGCTACTGTCGAAAACTGGGTATATGGACCA 
AGGAATGGATGGTACCAGAAGCTTAGGTTTGTCGATGGAAGACTAGTTCAGATAAAAGGCAGTATGGACTAG 

Protein sequence: (SEQ ID NO: 135) 

MLRNI S I G v^LAtiAAKIjGS YG V AAATLRCGS AI V$ EGDLI DD VLRKCGNPDSRKI EG P AVDG SG Y I VRG AATVENWVYGP 
RNGWYQKLRFVDGRLVQI KGSMD 
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FIGURE 12 
RL033 

DNA sequence: (SEQ ID NO: 12) 

ATGAAACTTATCCTTGATTTCGACGGACGCCTTCTAAATCCAAGCAACATGCTAGAGGCCCTATCAAAAGCAGGAAAAAA 
TACAAGC^TCAGCATAAGCAACGCGCAAGCATTAAATATAGAAACTCTTCTCAAGGCAACAACCACTGCAGAAAACACAA 
AAAACCTCTCAACAACTTTCAACGGCGCAGAGCTGACTGCTAACAACCTTCAGCAAGTCATAAACTCAGCAGGATCACTA 
ACCAGAGTATCCACAATAGCCGCACAAGCCATTAATATAAACACACTTCTTTCCGCAATATCTACAGCAGGCAACTCAAA 
GAATTTTAGCGCAGAATTCAATGGAGCCCAACTCAGCAGCGACAACCTACTTAGAGCAGTAAATGCGGCAGGAACAAACA 
CCAGCATAAGCGTCAATACCGCACAAGCGGCAAATATAACCGCCCTTCTTCAAACTATTCATGCAGCAGGTGACACAAAA 
ACATTCAGCGCAGAGTTCAATGGCGCTCAACTTACTTCAAACAACATTCAACAAGCTTTAGACGCCGCAGGAACCCGAAC 
ATCCATTaGCGTCAATACCGCA 

AGTTTAGCGCCGACTTCAATGGTGCACAACTAACAGCAGACAACCTTCAGCAAGCGATCAGCGCTGCGGCCTCGGGTACC 
AATATCAGCGTCAACACCGCTCAGGCGGCGAATATATCCACCCTTTTACAGGCCATCAACATCGCGGGCAACACTAAAAA 
ATTCAGCGCCAACTTTAATGGTGCCCAACTCACTTCAAACAACATCCAGCAGGCGCTCCGAGCGACAGGATCAAACACAT 
CAATCAGCATGAACTCCGCACAATCCGCCAACCAAAGCACTCTACTTGAACTTCTAGACATAGCAAGTTCCAGCAAGCAA 
TTCCAAGCCAATTACAACGGTGGCATGTCTAATCCGAACAACCTACAACAGATAGTTTTCCCGTGCAGGCGCCAGTACAA 
CCGTGTTTATTTCCGACGCACAAGGCCTACCAATCGCAAATATCCTTACCCTTATATCATCTGCCGGATGAGAeTTATAG 
CCGTGGATGAAAACACACCATCCACGGCTATACCCTAG 

Protein sequence: (SEQ ID NO: 136) 

MKLILDFDGRLLNPSNMLEALSKAGKNTSI SI SNAQALNI ETLLKATTTAENTKNLSTTFNGAELTANNLQQVINSAGSL 
TRVSTIAAQAININTLLSAISTAGNSKNFSAEFNGAQLSSDNLLRAVNAAGTNTSISVNTAQAANITALLQTIHAAGDTK 
T F S AE FNG AQLTS NN I QQALD AAG TRTS I SVNTAQAVNI STLLALINSAKDTKKFSADFNGAQLTADNLQQAI SAAASGT 
N I S VNTACAAN I S TLLCA INI AGNTKKF £ ANFNG AQLTSNN I QQALRATG SNT S I SMNS AQ S ANQ S TLLELLD I A S S S KQ 

FQ ANYNGGMSN PNNLQQ I VF P C RRQ YNR VY FRRTRPTNRKY P Y P YI I CRMRLI AVDENT P STAI P . 
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FIGURE 13 

RL034 

DNA sequence: (SEQ ID NO: 13) 

GTGCAGTGGACTCACGAACAGTCACCGATCATCCAGTCGAAGGCACCGAAGATCCTGGTGCGAGCCTTCGCAGGCACTGG 

CAAAACTACCACCCTGGTGGGCTTTGCCAGGTCGAACCCTACCCTGAGAATCCTCTATCTCTGCTACAACAGCTCGGTGG 

AGAAAGCCGCGAAGGGCAAGTTTCCCCGCAACGTAGTGTGCAAGACCGCCCACAGTCTGGCTCATGCGGTGTACGGCATT 

CAGTACGCCCACAAGAAGACGAAGAACCTGCGACTGACCGATATCGCCCGCGGACTCGATACCCAAGACTGGGAGTTGGT 

ACGTGACGTGCTGGCCACGCTGAACAACTACATGGCCAGCGCCGACGCGGAACTCGGCCGACCGCACTTCCCGCGCTTCC 

GCGACAAGGCGTTCCTCACCAGTGCTCAGGAACGCTTCCTCAAGCAGGGCCTGGACATGGCGCGAGTAGTCTGGAGGCGC 

ATGGTCGATCTCCAGGACACCGGCATGCTGATGCCCCTTGACGGCTACCTGAAGCTGTATCAACTGAGCAAGCCCGATTT 

GAGCCAGCGCTTCGACTGCATGCTCCTGGACGAGGGGCAGGACATCAACCCAGTGATCGCGGACATTGCCCATTGGCAGC . 

GCATCAGAATGGCTATCGTCGGCGATCCCCATCAGCAGCTCTACCGGTTCAGGGGCGCAGAAGATGCCCTGAACAGCGAC 

TGGATGGCCGGCGCCGAGGAGCACTACCTGACCCAGAGCTGGCGATTCGGCCCCGCGATCGCACACGTGGCCAACATCAT 

CCTCTCCTACAAGGGCGAAACACGGAAACTTCAAGGACTGGGTCCGCAGACGCTGGTGAAAAAGTCCCTCCCGCCGGACC 

TTCCTCACCGCACTTTCATTCACCGCACCGTTATCGGCGTCATCGAGAATGCCCTGCAGCTGGTCCGCAATCATCCGGAG 

CCCAAATTCCACTGGGTAGGCGGTATCGACAGTTACTCGCTGCGCGACCTGGAGGATCTGTACGCATTCAGCCGAGGCCT 

GCGCCAAAACGTCCAGAACAAGAAACTGCTCCGTGACTACCGCGACTACACCCAGTACGTGGAGATCGCCGAGATCAGCC 

AGGACGGTGAGATGCTTCGCTCGATCAAGATCATATCGACCTACCCTGATCTGCCTGCGCGGATCCTTGAGCTTCGCTCA 

CTGACCCTTGACGATGAGCTGGACGCAACAATCACCCTGACCACCGCACACAAGGCCAAGGGGCTGGAATGGGATTTCGT 

TTGCCTGTACGACGACTTCAACGCGGACCCGCTGGCCCCCGACACCGACCCAGGCAAGCGCGACGATGAGTTGAACCTGA 

TCTACGTCGCAGTGACCCGCGCGATGAAGATCCTTGCCATCAACAGCCTGGTGCTGTCGATCATGCAGCGGTACGTGGAC 

GACAGAAAACTGAAGGAGCAGATAGCTAGCTGTAAAAAATGA 



Protein sequence: (SEQ ID NO: 137) 

VQWTHEQSPIIQSKAPKILVRAFAGTGKTTTLVGFAJ^SNPTLRILyLCYNSSVEKAAKGKFPRNWCKTAHSLAHAVYGI 
QYAHKKTKNLRLTD1ARGLDTQDVJELVRDVLATU , 3^ 

MVDLQDTGMLMPLDGYLKLYQLSKPDLSQRFDCMLLDEGQDINPVIADIAHWQRIRMAIVGDPHQQLyRFRGAEDALNSD 
WMAGAEEHYLTQSWRFGPAI AHVANI I LS YKGETRKLQGLGPQTLVKKSLPPDLPHRTFIHRTVIGVI ENALQLVRNHPE 
PKFHWVGGIDS YSLRDLEDLYAFSRGLRQNVQNKKLLRDYRDYTQYVE I AE I SQDGEMLRSIKI I STYPDLPARI LELRS 
LTLDDELDATI TLTTAHKAKGLEWDFVC^^ 
DRKLKEQI ASCKK . 
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FIGURE 14 

RL035 

DNA sequence: (SEQ ID NO: 14) 

ATGTTCGGGTCGCTGATCGGCGCAATCATCGTGGAGTGGGTATGCCTGTATTTCTTCTGGCCTGACGCGGGCTGGAAGCA 
TGCCCAGGCCATGTTTGAGTACGAACTCAGTTGGCTGTCGCAGGGGCTGCTACACAGCGTCGTCGTGCAGGAGCCAGGTC 
GAACCGCCACCTGGCTGGCCCAGTTGGCCTATGACTGGTTGTTCGTGAAGACCGGGATGGTCGACTGGATGACCAACATG 
ACTACCATCGCGCAGGCCCGGCCACGGAGCCCGCTGGACGTTCGCTATCTCACCGCCCACGGTGTCTCCACGCTGCAGAA 
CTACGGCCTGGCCGCGCTGTACACGGTGCTGACATTCGTCGTGCGCCTGGTGATCCTGGTCATGACGATCCCGTTATTCG 
TGATGGCCGCGTTCACCGGCCTGGTGGACGGCCTGGTGCGCCGGGACCTGCGCAAGTTCGGCGCCGGCCGGGAGTCCAGC 
TACCTCTACCACAAGGCGCGCGGCAGCATCATTCCGCTAGCGGTCGTCCCTTGGACGCTCTACCTGGCAATCCCCATCAG 
CATCAATCCCCTGCTCATCCTGTTGCCCTGCGCCGCGCTGCTCGGCGTAGCGGTATGCATCACAGCATCCACCTTCAAAA 
AG TAC CTG TAG 

Protein sequence: (SEQ ID NO: 138) 

■ MFGSIilGAI IVEWVCLYFFWPDAGWKHAQAMFE YELSWLS QGLLHS VWQEPGRTATWLAQLAYDWLFyKTGMWWMTliW 
— TTIAQARPRSPI^WYfcTAHGVST^ 
- YLYHKARGSI I PLAWPWTLYLAIP I S INPLLI LLPCAALLG VAVCI TASTFKKYL . 
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FIGURE 15 
RL036 

DNA sequence: (SEQ ID NO: 15) 

ATGAAGTTGAAGAATTTCTTACAGCCTTTTGATAGCGGTTTCTCCACTCCGAGTGCTGCGCTCAAGCTGCTCCGCATGCT 
CGGTGGCGCCTTGATGTTGTGCGTGCTATGCAGCCTGATATTCAGTGTGAGCATGGTTTTAAACCATCAGGTGTCCCTCA 
GTCGGCAAGCTATGAATGTGGCTATGTACGAAGCGCAGCTTTATTTCGAGCAGCGCGAGGCGTTGCTCAATCACTTGAGC 
GGCAATGTCGTGCCCTTGGCCGCGGGTAGAGCGCTCGTCAACGAAGCGCCGAACAATGTGAGCATCCTGCCGTTGAGTGA 
CGG AG GGCG AGG TCTG CTATTG ACCGCTCGC ACG CTCGGTG ATCTCCGGG AAAAGCGGCTGG CACTG ATGTATCTGGTCG 
ATACCGACAAAGGCCCTCTGGTTTACCGGCTTACCGCCGATGGTAGGCCCTCGGCAGCGATATCCAGCACGATAACCAAA 
GAGGTGTACCGAGCCTTGCTGGCGACTCCGTCGGCGCCTGTTCACTGGGTGACTGACGGTGGTACCCCTCAACGGCTGTA 
CCTTTTTG AATCCTT AGGCG ATG AG C CGGGCG AGGGGTGG CTAGG C CTGG AG ATTCTCGG CG AAG ACCTCG ATTCGATGT 
TG CGC CGG AATG ATGCCGG AAACTAC ATG CTG CTGG ATC AGC ATGGGCAGGTCGT ACTCGCTACGG ACG C AG AGG CGCTG 
GGGAGCGGTGCGTCGCGGACGCTTTTGCGTGGAGACGGCTTCGGTTTCATCGGTGCTGGCCCACTGCCGCAGCATATGGT 
GCTTTTCCAGCACGTGGGGTCTTCGAGCTGGGATCTGATCTATCACATCGGTATCGGTCGCCTGTTGCTGGCTCTGTGGC 
TCCCTCTGTTACTTGCCTCTGCGTTGGCACTCGCAGTCGGCATCCTACTGCATTGGCTGGTGCGGAGCATCGAGCGACGC 
J£XG ATAG AG C CCGC AAA'^G ACGCCTTG AAG C ATTG AAGG AG AGCG AAGCCTTTTCCCGTG CAGTTATCCAGGCCGCGCC 
CGTCGCGCrGTGCGTCCTGCGTC^ 

GCGAGGCGATTGCCCACGACGCGCCGAGATGGATTTCCCAGGCGTTCGCAGGAGGTGTGAAGTGTTCTGGAGAAGAACTG 
GAAACCGAGGCAGGGCTACATCTTCATCTCAATTACACGCCCACCCGCTATAACGGTGAAGACGTATTGTTCTGCGCCTT 
CAGTGAAATCAGTGCACGCAAGCGGATGGAGGCGGAACTGGCTCGCGCAAAATCCCTGGCGGATGCTGCCAATGAAGCCA 
AGACGCTGTTTCTCGCCACCATGAGCCATGAAATCCGCACACCTCTGTACGGCATGCTTGGCACGCTTGAGCTGCTTGGG 
CGTACCGAGCTGAGTCGGCAGCAGGCCGGTTACCTAAAGGCAATCCAGCATTCCTCGTCGACCCTGCTGCAACTGATCAG 
CGATGTGCTTGACGTATCCAAGATAGAGGCCGGCCAACTGGACCTAGAGTGCGTGGAATTCTCCCCGCTGGAATTGACCG 
AAGAGGTCGTGCAGTCGTTCACCGGTGCCGCGCAGGCCAAGGGGCTGCAGTTGTATACCTGCCTCTCTGCGGAGCTGCCG 
CTGCGCATGCGGGGGGCCGCGGCGTCGATCCGGCAGATTCTCAACAACCTGCTGAGCAACGCGGTGAAGTTCACCGACAA 
TGGCT ATGT C AACGTCCAC CTG AAG G CC AG CGTGGT CG ATGCCG AATG TGTGATGCTG ACCTGGCAGGTCAACGATACCG 
GCATGGGGATCAACGTCGAGGATCAGCCGCGTCTGTTCGAACCGTTCTACCAGATACGCCGCTCCGAGCATCCGGTCGCA 
GGCACGGGCCTCGGCTTGTCGATCAGCCAGCGCCTGGCGCAGCTAATGAATGGCAGTCTGAAACTGGTCAGTGAGCTGGG 
GTTGGGCAGCAGCTTTAGCCTCAGGCTTCCGCTTGAGCGGATCGCGATGCAGGCTGAGCCGCAGGACCTAGCCGGGTGCG 
CCGTCCAAGTGCTGGCGCCTGTCCGCGACCTAACGGAATGCCTGTGTGGCTGGATCTCCCGCTGGGGTGGAAGGGCCATG 
GTCGCGACGCCGAGGTCGCTGGACGAGGCGGACGCGACCTCGCTGCTGGTCAAAGTGTTACTGCTGGAGGGGGCGCCGAT 
GTTCGAAGCATGGCCAGGATGCCGGGTGGAGCTTTCCCCTCAGGGTGATATGGAGCCGCAGGCACAGGGCCGCGACTGGC 
TGCTCGGGCTCAACAACCTGAACGGCCTGCATCGTGCTCTGGGCCTGGCCCATGGGCGTCTCGCTGATCCTTCGACGCCG 
CCGATACGGCTGGCTCCGTTGCGCAATCTAGGTCTCCGCGTCCTAGTGGTGGAGGATAACGCGATCAACCAGTTGATCTT 
G AGGG AC C AG ATGG AAGCGCTGGG CTG CAG CGTGG AG CTGCTCTTCG ATGGTCG CG AGGCGTTGCTG CACTGC CAGACGG 
CCTGCTTCGACGTGGTGCTCACCGATATCAACATGCCGAACATGAACGGATACGAGCTAACCGCGGAGCTACGGCGCCAA 
GGGTTCCGGCAGCCGATCATCGGCGCGACGGTGAACGCCATGCGTGAGGAGCGCGAGCGCTGCATGTCCGCCGGGATGAA 
CGATTGCCTGGTCAAACCGGTGGATCTGAATGCCCTTCAGAACTGCTTGATTAATATTCTCAAGGTGGATCGATGA 



Protein sequence: (SEQ ID NO: 139) 

MKLKNFLQPFDSGFSTPSAALKLLRMLGGALMLCVLC 

GNWPIJ^GRALWEAPNNVSILPLSDGGRGLLLTART1X3DLREKRIALMYLVDTDKGPLVYRLTAI)GRPSAAI 
EWRALLATPSAPVHVATTDGGTPQRLYLFESLGDEPGEGWIXSLEILGEDLDSMLRRNDAGNYMLL^ 

GSGASRTLLRGDGFGFIGAGPLPQHIWLFQHVGSSSWDLIYHIGIGRLLLALWLPLLLASALA1AVGILLHWLVRSIERR 
LIEPAKRRLEALKESEAFSRAVIQAAPVALCVLRRADAAWLENPQARQWLGDSEAIAHDAPRWISQAFAGGVKCSGEEL 
E T E AG LHLH LNYT PTR YNG ED VL F C AF S E I S AR KRME AE LARAKS LAD AAN E AKTL FLATM S HE I RT PL YGMLGTLELLG 
RTELSRQQAGYLKAIQHSSSTLLQLISDVLDVSKIEAGQLDLECVEFSPLELTEEWQSFTGAAQAKGLQLYTCLSAELP 
LRMRGAAAS I RQI LNNLLSNAVKFTDNG YVNVHLKAS WDAECVMLTWQVNDTGMG INVEDQPRLFE P FYQ IRRSEHPVA 
GTGLGLSISQRLAQIJWGSLKLVSELGIXSSSFSLRLPLERIAMQAEPQDLAGCAVQVLAPVR^ 

VAT P R S LD E ADATS LLVKVLLLEG AP M FE AW PG CR VE LS PQGDME PQ AQG RDWLLG LNN LNG LHRALG LAHG RLAD PST P 
PIRLAPLRNLGLRVLWEDNAINQLILRDQMEALGCSVELLFDGREALLHCQTACFDWLTDINMPNMNGYELTAELRRQ 
G FRQ P 1 1 G AT VN AMREE R E RCMS AGMNDC L VKP VDLNALQNCL I N I LKVDR . 
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FIGURE 16 
RL037 

DNA sequence: (SEQ ED NO: 16) 

ATGAG CTGG AAATC CTATCGGGTGCTGG TGGT CG AAG ATC AGCCG TTTCAG CGCG AAT ACCTGCTCAACCTGTTTCGCG A 
GCGCGGCGTGCAGTACCTGGTAGGTGCCGGCGACGGCGCGGAGGCGTTGCGCTGCCTGAAGCAGGACAGGTTCGACCTGA 
TCCTCAGCGATCTGATGATGCCGGGCATGGATGGTATCCAAATGATCCTGCAACTGCCGTATCTCAAGCATCGTCCGAAG 
CTGGCGCTGATGAGCTCCTCGTCGCAGCGGATGATGCTCAGTGCCAGCCGGGTCGCCCAGAGTCTCGGCTTGTCGGTAAT 
CGACCTGTTGCCCAAGCCGACTCTGCCCAAGGCCATCGGCCAACTTCTGGAACACCTGGAAAGATGCCTCAGGCAGAAGC 
TGGAGCCGGAAACCGACGAGACTCCGCATGGGCGCACGGCGTTGCTGGATGCCCTGCATAACGAGCAACTGGTGACCTGG 
TTCCAGGCTAAGAAATCCCTCCACACCGGGCGCATAGTCGGCGCCGAGGCGTTGATACGCTGGAGCCACCCGCAGCATGG 
CCTGTTGCTGCCCAGCTGTTTCATGAGTGATGTCGACGCTACCGGTCTGCACGAGGCGTTGCTCTGGCGCGTGCTCGAAC 
AGACCCTGAACGCCCAGGAATCGTGGCGCAGGGCGGGTTACGAGATTCCGGTTTCGGTGAATCTGCCGCCGCACCTGCTC 
GATAACCAGGAACTTCCGGATCGACTCTATGAGTACGTCGGCGCTCGCGGGGCTTGTACCAGCTCACTATGTTTCGAGTT 
GACCGAGAGCAGTGTCACAACTCTGTCAAGTAACTACTATGCAGGTGCCTGTCGCTTGCGCATGAAAGGGTTCGGATTGG 
CCCAGGACGACTTTGGCCAGGGTTACAGCTCGTTCTATAACCTGGTCACGACGCCTTTCACGGAGCTGAAGATCGACCGC 
TCCCTAGTCCAGGGATGCGTAGAGGATAACGGCCTCAATGCAGCTGTCATCAGTTGTATTGAGTTGGGTCACCGCCTG^ 

^ JTCTCG ACG TGGTG.G CCG AAGG CGTGGAG ACCTGCG A 

lGTXXCCtGAT.TTCTAAGGCAG 



Protein sequence: (SEQ ID NO: 140) 

MSVJKS YRVLWEDQPFQREYLLNLFRERGVQYLVGAGDG AEALRCLKQDRFDLI LSDLMMPGMDGI QMI LQLPYLKHRPK 
LALMSSSSQRMMLSASRVAQSIX3LSVIDLLPKPTLPKAIGQLLEHLERCLRQKLEPETDETPHGRTALLDALHN 
FQAKKSLHTGRIVGAEALIRWSHPQHGLLLPSCFMSDVDATGLHEALLWRVLEQTLNAQESWRRAGYEIPVSVNLPPHLL 
DNQELPDRLYEYVGARGACTSSLCFELTESSVTTLSSNYYAGACRLRMKGFGLAQDDFGQGYSSFYNLVTTPFTELKIDR 
SLVQGCVEDNGLNAAVI SCI ELGHRLNLDWAEGVETCEELNLLRRLGCDRAQGFLI SKAVSAREFERQLREDGPSLLV . 
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FIGURE 17 



RL038 

DNA sequence: (SEQ ID NO: 17) 

GTGAAGTCTGCTAGTGCCTTGGAGCACGACAACAAACTTTTGCTCAAATGGACAACCCTCTCGCAGAGCCTGAGCATCGG 
CTTGATCTGTGTGGTGGTGCTGACCGTATTGCTGTTCAGCATCTGTTACTGGTCGCTGGGGAGATTGTTTCAGGAGGAGG 
AGGACAAAGTCTCCTTCCACTTCACCCGTATGATGGATGTTATACGGGAGCATGAGGTATTTCTTGGGCGCATCGCTGGA 
AAAAGCGACAAGACCACCCAGAAGTACGACTATGACGTGGTGCCTTTGCAGCGGCACTTGTTGGCAAAGGAAAACGGATT 
AGCGGTCTATGAGGGACGGGAGTTTTCCTTTGCTATGCCATTTCTACTGGCTACCAAGCACGCGTTGAGCGCCGATTCCT 
CGGGAGATCCGTTTTCGCTCGGTGTATTGCTCGCCAATTTCTACGGAAGCTTCTGGAGTGTTTCCGCCTATCCCGCGCCA 
CAGTTACTGATCTTTGATCTTTCCGGCAGCACCCGCCTGGCAGTGCCGTCGATTCCCTCCACAGCGCAGCGTGACAGGTT 
GAGCGGAAGCTATCCGATGATAGTCGAGCGCATTCTGGCGCGCTTGCGCACCCGGCCGGTGGGGGAGGACGCTCAGCGTG 
TCCATTGGATACGCGCTGATCGCTATCGCGACTCGGCGCTGGAGATGTTGGGAGTCGCCCGGGTTGATCTGCCGGAAACA 
CTCTGGTGGCACGACGAGCCGAACCATCTGATCATCGCTGCGAGCCTGCTTGATCTCAGGCGAATCAATGACTTCGAACA 
GTTGGTTGAGCGCCCGGCATTCGATTCGTACAGCCTGGTATCGCCGGATGGCGAGGTATTGCTCGGCGCGGCCCCTGCGA 
CCGGCCTGAGGGATGGCCTGAACCTCACCCGACAGGGGGTCGCCGTTCAACTGCTCAGCCAGCCTGAGAACGGTTGGCTC 
G CGGTCT ACCGAACCG A CTACGG CAATTTCTTTCG CC ACTC C CGGTGG CTGG TGG C AGGTCTGCTGCTGACCCCGGCGCT 
-^GGTGGTGGCGGGT-TGGGTGGGGATGGGTTGGTAGAGGAGGAGGGTGGTGAACGCGGTGGATGGGGCGCACCGGCAACTGG 
'TGGAGAGCGACACCTTCAGCCGGACGCTGATA^ 

CTGGTGACCTGCAACCACTTGGCCGCCCAGTGGCTGGGCGGGCCCACGGAGATCCTTGGGCTGACTTCCAACTGGAAGCT 
TTTCGATGCGCGTGGGCAGGTACCAGGAGACATCTGTATCCAGGTCGGTGGGCGCTATTTGCAGACCGCCTTCGCGGCGA 
CCCGCTATGCCGGCACCGAGGCGGTACTGTGCGTATTCAACGACATCACGGTCCACTGCGAGGCGGAGACCGCGCTGTCC 
AATG CG AAG CG AGCAG CGG ATG CCGCC AGCCAGG CCAAG ACC CTGTTC CTGG CCCGCATG AGCCATG AAATCCGTACTCC 
CCTGTACGGTGTCCTTGGCACCCTGGAGTTGCTCGACCTGACCACCCTGAACGAGCGGCAACGCGCCTACCTACGCACCA 
TCCAGAGTTCGTCTGCGACGCTCATGCAACTGATTAGCGATGTGCTGGATGTCTCGAAGATCGAAGCGGGGCAGATGGCT 
CTGACCCTGGCCGCCTTCAATCCGCTGGACCTAGTGCGGGAAGTGCTTGGCAACTTTGCCGCCAGCGCCATGGCCAAGGA 
CCTGCAGTTCTATGCCTGCATCGACACCGAAGTGCCGGCGCAACTGATCGGTGACGTGACGCGGATTCGCCAGGTGCTCA 
ATAACTTGGTGAATAACGCGCTGAAGTTCACCGATATCGGACGGGTGGTCCTGCGCGTGAAGTTGCTCTCCCGCAATGAT 
GGTCGAGCCCTGTTGCAGTGGCAGGTCGCCGACACCGGTATCGGTATCGCACACGAACAGCAGGAGCGCTTGTTCGAGGC 
GTTCTACCAGGTTTCGGGAGCGCACCATGCCGGCGGCACGGGGCTAGGACTGTCGATCTGCTGGCATCTGGCGGAAATGA 
TGGGCGGTCACCTGCGAATGGTCAGCGAGACAGGGCTCGGCAGCAGCTTCAGCCTGGTGCTCGAGTTGCCCGAGGACGAA 
CAGTCCGGGCTGGCTTGCCGGCCGGGGCTCTTGAAATCCGCTTGCGTCCATGTGCGCTCGCCCGTGCGGGAGCTAGCCGA 
CAGCGTAGGGGCGTGGCTGAAAGCCTGGGGCTGCAAGGTCAGCAGCGGCGAGGCGGCGCCCTCCGAGCTGGAGACTTGTG 
TGCTTCTGGAGCTGCTGCCGATGGCGGCCGGGCCTGCTTCTTCGCCCTGGCCAGGCCCCCGGGTGCGCGCGTCCATGGAT 
GCGCCTTGCCAGCCGGAGCTGCGTGAGGACGGCTGGCGTGTCGGCCTGCACAACCTGGCGGGAATCGGCCAGGCCCTGGC 
GCAGGCTCTGGGTGGCGATATCCCCGAGCAAACGCCGGCAAATGCCTGCGCCCGCTCGGGGAGACTCGACCTGGAAGTGC 
TGGTCGCCGAGGACAACCCAGTCAACCAGGCGCTGCTTCGCGAGCAACTGGAAGAGCTGGGTTGTCGCGTGAGCCTTGCC 
GGCG ATGGGCGG C AG GCCCTG CAGCTGTT CG AC AGTGGTCG CTTCG AC CTCCTGCTCAG CGACGTCAACATGCCG AACAT 
GACCGGCTACGAACTGACCCAGGCGCTGCGCGAACGAGGCGAGACGCTGCCGATCATCGGCGTGACCGCCAACGCCCTGC 
G AG AAG AGGGCG AG CGCTG C CGGGCAGTGGG AATG AAC AG TTGGCTGG TG AAGC CGATCACTCTGCAT AC CTTG CATGAA 
CTGCTCAGTGAGTTCGCTCGCGCAGGTGTCGTGCTTCCCGCGCAAGCGCGAGACCTCGGCCCGCCCGCGCAGCTCGACGA 
CGGTCTCTCACCGCAGGTGCCGGAACGCATGCGCGCGCTTTTCCTTGAGACCATGGGCAAGGACCTGGAGGCCGCCCGGC 
AAGCGATTCGCCGCAACGACCCGAAGGGGCTGCAGCAGGACCTGCATCGCATGGCCGGCTCCCTGGCGGTGATGCGTGCG 
CGAACGCTGGTGGTGATGTGTCAGGGCGCCGAGGAAGGCCTGCTGGAGTCGCGCCTTGAATGTTCCGCCGTGGAGATTGG 
CGAGGTGCTCGTTCATATCGAGCAGGCGCTGGAGTTTGTGAGAAAGACGGGCTGA 



Protein sequence: (SEQ ID NO: 141) 



VKSASALEHDNKLLLKVITTLSQSLSIGLICVWLTVLLFSlCYWSl^RLFQEEEDKVSFHFTRMl^VIREHEVFLGRIAR 

KSDKTTQKYDYDWPLQRHLLAKENGLAVYEGREFSFAMPFLIATKHALSADSSGDPFSLGVLIJ^FYGSFWSVSAYPAP 

QLLIFDLSGSTRIAVPSIPSTAQRDRLSGSYPMIVERILARLRTRPVGEDAQRVHWIRADRYRDSALEMLGVARVDLPET 

LWWHDEPNHLIIAASLLDLRRINDFEQLVERPAFDSYSLVSPDGEVLLGAAPATGLRDGLNLTRQGVAVQLLSQPENGWL 

AVYRTDYGNFFRHSRWLVAGLLLTPALLLAGWLGMRWYTSSWNPVHRAHRQLVESDTFSRTLIQTAPVALWLTQDDQQ 

LVTCNHLAAQWLGGPTEIIXSLTSNWKLFDARGQVPGDICI^^ 

NAKRAADAASQAKTLFIJU^MSHEIRTPLYGVLGTLELLDLTTL 

LTLAA FN PLDL VR E VLGN FAA S AMAKDLQ F YAC I DT EVP AQL I GD VTR I RQ VLNNLVNNALKFTD I GR WLR VKLLS RND 
GRALLQWQVADTGIGIAHEOQERLFEAFYQVSGAHHAGGTGl^LSICWHIAEMMGGHLRMVSETGLGSSFSLVLELPEDE 
QSGLACRPGLLKSACVHVRSPVRELADSVGAWLKAWGCKVSSGEAAPSEL^^ 

APCQPELREDGWRVGLHNLAGIGQALAQALGGDIPEQTPANACARSGRLDLEVLVAEDNPVNQALLREQLEELGCRVSLA 
GDGRQALQLFDSGRFDLLLSDVNMPNMTGYELTQALRERGETLPIIGVTANALREEGERCRAVGMNSWLVKPITLHTLHE 
LLSEFARAGWLPAQARDLGPPAQLDDGLSPQVPERMRALFLETMGKDLEAARQAIRRNDP 
RTLWMC QG AE EGLLE S RLE C SAVE I GEVLVH I EQALEFVRKTG . 
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FIGURE 18 

RL039 

DNA sequence: (SEQ ID NO: 1 8) 

ATGCGTCCGGGGTCAATAGTTGGAATTAGAACACAAGAGAAGCCTATGAGTAAGCTCAAGATAGTACTGGCCGATGACCA 
TCCGATCGTGCGTATGGGCGTATGCGACATGCTCGAGCGCGACGGTCGGTTCGAGGTGGTGGGCGAGGCCTCCACGCCCA 
GCGAACTGGTCGAGGTGTGCCGGCAGAGCGAGCCGCATATCGCCATTACCGACTACAGCATGCCCGGGGACGAGCGCTAC 
GGCGATGGACTGAAACTGATCGACTACCTGTTGCGCAACTTTCCTCGTACTAAGGTGCTCATCTTCACCATGGTCGGCAA 
CCGCCTGATCCTCGACAGCCTCTACGATCACGGGGTGTCCGGCGTGGTGCTGAAGAGCGGCGAACTCGACGAGCTGCTCT 
TGGCGCTCGACGTGGTGAAGCAGAACCGCGTCTACCGGGGCGCGAACATGCTCGACCCGACCAGTGTTCTGGCGAACCGC 
GACGAAGTGGAAAGCCGCTTCGCGCGCTTGTCGATGAAGGAGTTCGAAGTACTCCGTCACTTCGTTTCCGGCAGCAACGT 
CTGCGATATCGCACGGCTGCTGAAACGTAGCGTCAAGACCGTA_AGCACGCAGAJiGGTCTCGGCGATGCGCAAGCTGGAAG 
TGAACAGCGACCAGGCCTTGATGACCTTCTGCGTGCATGCCAACTTGTTCCATTGA 

Protein sequence: (SEQ ID NO: 142) 

MRPGSIVGIRTQEKPMSKLKIVLADDHPIVRMGVCDMLERDGRFEWGEASTPSELVEVCRQSEPHIAITDYSMPGDERY 
GDG LKLI DY LLRNF PRTKVIil FTMVGNRLI IJ3S LYDHGVSGV^KSGELDELIjLALDVVKQNRVYRG ANMLDPT S VLANR 
DE VE S R F ARL S MKE FE VLRH FVSG SNVCD I ARLLKRS VKTVS TQICVSAMRKLEW SDQALMTFC VHANLFH . 



Figure 1 8 



1 



Title: VIRULENCE-ASSOCIATED NUCLEIC ACIDS AND 
PROTEINS AND USES THEREOF 
Applicants: Laurence Rahme et al. 

Filing Date: September 12, 2003 Serial No.: Not Yet Assigned 
Page 35 of 118 Customer No.: 21559 

FIGURE 19 

RL040 

DNA sequence: (SEQ ID NO: 19) 

GTGTCCAGTAAGATCCTGCTGCAAGGGGCACTGCTCGGCCTAGCAATGCTGGCCGTGCTGGACGCCCGAGCCGGAGTCAC 
CGCCGAGCGCACTCGGGCAATAATCGCCGAGGGGCACCGCGAGACGTCGCTGCTGCTGGTCAACCAGAATGCCTATCCGG 
TCATAGTGCAGACCTGGATCGACGATGGCGCCCCGAACTCGACACCGCAGTCTGCCCGCGCGCCGATCATGCCGCTACCG 
CCGGTGTTCCGCCTCGAACCCGGACAGCAACGCAGCCTGCGCCTGCTGCGGACCGGCCAGGCGCTGCCAGGGGACCGCGA 
ATCGCTGTACTGGTTGAACCTCTACGAAATCCCGCCGCAAGCCACCGGGCTGCTGGCCGAAGGACAGTCACGGCTGACCG 
TTACACTGCGCACCCAGATGAAAGTCATCTACCGCCCTCGCCCTCTGGCCAGAGGTGCGGAAGAAGCGCCACACCAGCTC 
AGGTTCGAGCGGCGGGGCGAAACACTACAGATGGAGAACCCTACTCCCTATTTCATCAGCCTCGCCGGCGCCGAGCTTGG 
CGGCCACACCCGCCTGGCGGCGGCCGAACTGTTGCCCCCCTTCTCCAGGCGCGTCCTGGCGCTCCGCCAGGCGCTGCCCG 
G CGGCCAGG C CG AGGTGCGCTTCAG CTGG ATCG ATG ACGG CGG C AATCTC C AGC AGGG ACGG AGCCTG CTTC ACTG A 

Protein sequence: (SEQ ID NO: 143) 



VS SKI LLQGALLGLAMLAVLDARAGVTAERTRAI IAEGHRETSLLLVNQNAYPVI VQTVilDDGAPNSTPQSARAPIMPLP 
PVFRLEPGQQRSLRLLRTGQALPGDRESLYWLNLYEIPPQATGLIAEGQSRLTWLRTQMKVIYRPRPLARGAEEAPHQL 
RFERRGETLQMENPTPYFISLAGAELGGHTRLAAAELLPPFSRRVLALRQALPGGQAEVRFSWIDDGGNLQQGRSLLH 
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FIGURE 20 
RL041 

DNA sequence: (SEQ ED NO: 20) 

ATGAAAACATCCCTGCGCGTCCTGCCTCTGCTCCTCGCGCTGCTCGCCTCGTCTAGCTGGGCGACCTGCTACAAGGTCAC 
GGCGGTAGGCAACGCCACGACTACCTCCAACACCCAGATACGTCCCGGTGAAGGCTCTGCCGGCACCTGGGCCGGAGCCT 
GCGATACCTGCAACGGTTCCCTCGGTCTACCGAGCGTGATCAACGTCAGCGACGCCAGCTTCCAGCCCGACGGTAGCTTG 
ATCG C CAGCTCGG TGG CGCCG CTC AGC C AATACGGCG ACAG CG C CGG CT ACG ACCC AG AG CGCGTGTTCTTCCGCTG TGC 
TCCAGAGGACGATGTGTACGAGATGTTCTCCACCAATGCCGACGATCTCTACAGCGGCTGGTACCTAGGAGGCGACAGTG 
CGGGCAACTCGATTGGCCTGCAGTCCGCCTATCGCACCGCCTGGCCCAACGTGCTGCTGCGCCTAACCCACGTGGAAACC 
GGGCAGTATTTCACCGATGTCTGGCGCGAGCGTCTGCTCGGCGGGCTCGATATCGACTCGCGAGGCTTTCAACTGGTCAA 
GGCGAAGAACCTCAGCGCGGTACGCGCCGAACTGTTCCGCGCTCCGCTGGAGTTCATCCGCTACTACTCGCCGACTACCG 
CCTCGCGGTTGTACGCCTACACCCAGCCCGCTGGCTACATCGCCATCAAGGGTCCCGGCCTGGCCTACCCCAACGTCGGC 
GCCAGCCATAACGCCAACTACCTCGGCTGGCACTACAACTGGCCGGGCGCCATCGGCCTGTACAACGACGTGACGCTCAA 
GCGCTATCCCACCTGTTCCGTAACCAACGTCACGCCCCACGTTGTGTTCCCGTCGATTTCCCTCAGTGAGATTAATGCCG 
GCGCGAACCGTGAGATGCCCTTCGAGGTGGCCTTCAAGTGCCAAACGGGAGTGATCAACAGCACCGCCTCCAGCGGTACT 
GCACTGGGTATGAGGGCT-TCAGCCGGGGCGCAGGCCGGGTCCGCTGGAGTGGGGCTGAGGAACGGCAATGGCGGGGTCTC 
~~GTACCTAGTTTCGGACGGGTAGGGCGAGCCTGGTATGGCGCAAGGGGTGGGTATCGGCTTGGTGGGG<3AGGGCAGTGCGA 
TGAACCTGCTGGTAAGCGAGGATTCCGCGATGGGCAGCAATGCCGAAACACGGGGCTGGTATCCAGTGATCGGCAACGCC 
TCGAACAAGACTGGCGAAGCGGGAGGCATCAGCCAGTACAGCGAGACCTTCCGTGCGCGCCTGGAAAAACTCACCGTTGG 
CAGCATGCCCAGCGTTACCCCGGGACGGGTGGAGGCCAGCGCGCAGGTAGTGATTCGTGTCCAGTAA 



Protein sequence: (SEQ ID NO: 144) 
• 

MKTSLRVLPLLLALLASSSWATCYKVTAVGNATTTSNTQIRPGEGSAGTWAGACDTCNGSLGLPSVINVSDASFQPDGSL 
IASSVAPLSQYGDSAGYDPERVFFRCAPEDDVYEMFSTNADDLYSGWYLGGDSAGNSIGLQSAYRTAWPNVLLRLTHVET 
GQYFTDVWRERLLGGLDIDSRGFQLVKAKNLSAVRAELFRAPLEFIRYYSPTTASRLYAYTQPAGYIAIKGPGLAYPNVG 
ASHNANYLGWHYIWPGAIGLYNDVTLKRYPTCSVTNVTPHVVFPSISLSEINAGANREMPFEVAFKCQTGVINSTASSGT 
ALGIRASAGAQAASAALGLRNANGGLSYLVSDRYGQPGMAQGVGIRLLRDGSAMNLLVSEDSAMGSNAETRGWYPVIGNA 
S NKTG EAGG I S Q YS ET FRARLE KLT VG SMPS VT PGR VEAS AQ WI RVQ 
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FIGURE 21 
RL042 

DNA sequence: (SEQ ID NO: 21) 

ATGTTCTGCCACGTTGAGGCACGGCGCACCGGCAAACTGCCGCTGGCTCTGGGCGGCCTAGCGCTGGCCTTCGCGGGCCT 
GGCCAATGGAGAGGCGCAGTATCGTTTCGACGACAGCCTGTTGATGGGCTCGGGCCTCGCCGGCGGGACCCTAGAACGCT 
TCAACCGGGCCAACCAGGTGGACCCCGGAACCTACCATGTCGATGTCTATCTCAACGGCAGCTACGCCAGTCGCACCAGG 
ATCGAGTTCCGCCCCCGGGCCGGCGGCGTCAAACCCTGCTTCGGCGAACGCTTCTTGCGCCGGACGCTGGGCGTCCGCCC 
CGCCTCTGAGGCCGGCGTGCAAGCGCCTGGAGATTGCCTGGGGCTGGAAGAACGCCTGCCAGGCTCGACCTTCAATCTCG 
ACACCGCCCTTCTGCGCCTCGATCTCTCGGTGCCCCAGGCCCTGCTGGATATCAAGCCACGCGGCTACGTGGGTCCCGAC 
GAGTGGGACGCTGGCAGTAGCATGGGCTTCGTCAACTACGACGCCAGCTTCTATCGCTCGAGCTTCGACGGAGTAGGCGG 
CAACGGCGACTCGGACTATGGCTACCTGGGGCTGAGCGGGGGCATCAATTTCGGCCTGTGGCGCCTGCGCCACCAGTCCA 
ACTACAGCTACTCCAGCTATGCGGGAAACACCCGCAGCGACTGGAACAGCATCCGCACCTATGCCCAGCGCGCGGTGCCA 
GGCCTGCGCAGCGAACTGACCCTGGGCGAGAGCTTCACCGAGGGCAATCTGTTCGGCAGCCTGGGTTATCGCGGCGTGCG 
CCTGGCCAGCGACGACCGCATGCTGGCAGACTCGCAACGCCGCTATGCTCCACAGGTACGCGGTACAGCGAACAGCAACG 
CACGGGTGGTCATCAGCCAGAACGGCAAGAAGGTCCACGAATCCGCCGTCGCTCCCGGTCCCTTCGTCATCAACGACCTC 
TATGGCACCGCCTACGACGGCGATCTGGATGTCCAAGTGATTGAGGCCGACGGCAGCGTCTCGCGCTTTTCCGTGCCCTT 
/TTCCG CGG^GGGGAAT C G ATGGG C CCGGGCATCTCGCG CT AC AG CGGGACCCTCGGCC AAGCGCGCC AGTATGG CG ACG 
G CAACG ACCT.GT-TGGG C CAGCGCGGC CTG AC C AACTCG CT7UVCCGCCAACCT.CGGCTCG CGC CTGGCC 

GAGGACTATCTGGCGCTGCTCGGCGGAGGCGTGCTCGCCACGCCCTACGGAGCCTTCGGCTTCAACAGCATCTTTTCCCA 
TGCCACGGTGGAGAACGGCCAGCGCAAGCAGGGCTGGCGTGTCGGTCTGAACTACAGCCGGACCTTCCAGCCGACCCAGA 
CCACTCTCACCCTGGCTGGCTACCGCTATTCCACCGAGGGCTATCGCGACCTCGGCGACGCGCTTTCGGCGCGCCACGCC 
G ATG AG C ACAACG ACTC CTGG AACT CC AG C AG CT ACAAG CAACG C AACC AG TTCACCCTGCTGGTCAAC C AGGG CCTGGG 
GGGCTACGGCAACCTGTATCTGTCCGGAGCCACCAGCGACTACTACGACGGCAAGAGCCGCGACACCCAGTTGCAGTTCG 
GCTACAGCAACACTTGGCGCCAGCTCAGCTACAACCTCGCCTATTCGCGCCAGCAGACCACCTGGTACCGCGATCTGAAC 
G ACG ACT ACG ACCCGTC ACTG C CG C CG C AAT AC AACCTGCG G C ACGG C AG CG AACGT AGC AACACCTT AAC C CTG ACACT 
TTCCATGCCGCTGGGGTCCTCCAGCCAGGCCCCGAATCTCAGCGCGATGGCCTCCCGGCGTTCCGGCGACAGCCGCGGCA 
GCAGCTACCAGACGGGCCTCAACGGCACCCTCGACGAAGACCGCAGCCTGAGCTACGCGATTGCCGCCGGGCGCGACAGC 
GACAACCACGGCAGCGATTTCAACGGCAGTCTGCAGAAACAGACCTCGGTGGCGACGCTGAACGCCGGCTATGCCGAGAA 
CAGCAGCTACCGGCAGCTCAACACCGGCCTGCGCGGCGCCGCCGTGCTGCATCGCGGCGGCCTGACCCTCGGCCCCTACG 
TCGGCGACACTTTCGCCCTGGTCGAGGCCAAGGGCGCCAGCGGAGCTGGCGTACGCGGTGGTCAGGGCGCGCGCGTCAAC 
GGCAATGGCTACGCCGTGGTGCCATCACTCTCGCCCTACCGCTACAACCCGGTCAGCCTCGATCCGCAGGGCATGGGCGA 
AGAGGCCGAGCTGCTGGAGACCGAGCGCAAGATCGCGCCATACGCCGGCGCCGCCGTGCATGTGAAGTTCCGCACACTGA 
CCGGTCACCCATTGCTAATCCAGGCCCAACTCGCCGACGGCAGCGCGCTACCGCTAGGGGCCAATGTGCTCGACAGCCAG 
GGTGTGAACATCGGCATGGTCGGTCAAGGCGGCCAGGTCTATGCCCGCGCCGAGGGCGACAAGGGCCGCCTGCGCGTGCA 
ATGGAGCGAACGCCCAGGGGACGCCTGTCTGCTGGATTACGACCTCGACACTGGCCCTCGCCAGGCTATCGAACCCGGAC 
AGG CGG TG ATC CGCCTG CAG GG C ACCTG C ACG C CCGTCTCGG AGG C ACCATGA 



Protein sequence: (SEQ ID NO: 145) 

MFCHVEARRTGKLPIALGGLALAFAGLANGEAQYRFDDSL^ 

IEFRPRAGGVKPCFGERFLRRTLGVRPASEAGVQAPGDCLGLEERLPGSTFNLDTALLRLDLSVPQALLDIKPRGYVGPD 
EWDAGSSMGFVNYDASFYRSSFDGVGGNGDSDYGYLGLSGGINFGLWRLRHQSNYSYSSYAGNTRSDWNSIRTYAQRAVP 
GLRSELTLGESFTEGNLFGSLGYRGVRLASDDRMLADSQRRYAPQVRGTANSNARWISQNGKKVHESAVAPGPFVINDL 
YG T A YDGD LD VQ V 2 E ADG S VSR FS V P F S AV PES MR PG I S R YS ATLGQ ARQ YGDGNDLFGD FT YQRG LTNS LT ANLG S RLA 
EDYIALI^GGVIATPYGAFGFNSIFSHATVENGQRKQGWRVGI^YSRTFQPTQTTLTLAGYRYSTEGYRDLGDALSARHA 
DEHNDSVWSSSYKQRNQFTLLWQG1X5GYGNLYLSGATSDYYDGKSRDTQLQFGYSNTWRQLSYNIAYSRQQTTWYRDLN 
DDYDPSLPPQYNLRHGSERSNTLTLTLSMPLGSSSQAPNLSAMASRRSGDSRGSSYQTGLNGTLDEDRSLSYAIAAGRDS 
DNHGSDFNGSLQKQTSVATLNAGYAENSSYRQLNTGLRGAAVLHRGGLTLGPYVGDTFALVEAKGASGAGVRGGQGARVN 
GNGYAWPSLSPYRYNPVSLDPQGMGEEAELLETERKIAPYAGAAVHVKFRTLTGHPLLIQAQLADGSALPI^ANVIjDSQ 
GVNIGMVGQGGQVYARAEGDKGRLRVQWSERPGDACLLDYDLDTGPRQAIEPGQAVIRLQGTCTPVSEAP 
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FIGURE 22 
RL043 

DNA sequence: (SEQ ID NO: 22) 

ATGAATACTTTTCCACTGCCTCCGCTCCGTGCGGCTACGCTGGCGCTCGCCCTGCTGATACCCGCCATCCCGGCTCAAAG 
CAGCGTGGTGATCATCGGTACTCGCGTGATTTATCCCGGCGACGCCCGGGAAAAGACCGTGCAGATGATCAATCAGGACG 
CATTCCCCAACGTGATCCAAGCCTGGATCGACAACGACGACCCCTCCTCCACCCCGGAGACTGCAAACGCGCCCTTTCTG 
GTCAGCCCAGCGGTGACGCGCATAGCCCCCGGCAGCGGCCAGACCCTGCGCCTCCTGTATACCGGGCTCCCGCTGCCCGA 
GGATCGCGAATCGTTGTTCCATCTCAATGTGCTGCAGATCCCGCCCCGCGACCTGGCCAAGGCCGAGCGCAACCAGATGC 
TGCTGATGCAGCGCAGTCGACTGAAGCTGTTCTATCGCCCCGCCGCGCTGCTTGGCGGCTCGGAGCAGCTAGTCGAGCAG 
TTGCACTTCAGCCTGGTGCAGGCGAGCGGCAACTGGCGTGTGCGGGTGGACAACCCCAGCGGCTACTACGCCTCCTTCGC 
CGGCGCGATGCTGAGCATCGGCGAACGTCGCTGGCGGC!'GCTGTCGAGCATGGTCCCGCCCAAAGGCCAGGCCGAGTGGG 
CGGCGGAACGCCCTTCGCCGCTCGCCCCAGGACCGGTCCAGTTGAACGCCCTCTTGATCAATGACTACGGCGCGCGAATG 
G AGGT CC AG CATGTTCTG C CACGTTGA 



Protein sequence: (SEQ ID NO: 146) 

mTFPLPPLRAATLALALLIPAIPAQSSVVIIGTRVIYPGDAREKTVQMINQDAFPNVIQAWIDNDDPSSTPETANAPFL 
VSPAVTRIAPGSGQTLRLLYTGLPLPEDRESLFH1JWLQIPPRDIAKAERNQMLLMQRSRLKLFYRPAALLGGSEQLVEQ 
LHFSLVQASGNWRVRVDNPSGYYASFAGAMLSIGERRWRLLSSMVPPKGQAEWAAERPSPIAPGPVQl^ALLINDYGARM 
EVQHVLPR. 
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FIGURE 23 
RL044 

DNA sequence: (SEQ ID NO: 23) 

ATGAAACCTCAAAGTACTGCCCTGACTATCGCCGCATTTCTCGCATTGCCGGGTATCGCGGCGGCTGCCAATACCATCAC 
CTTCCACGGAGAAGTGACCGACCAGACCTGTTCCGCCGTCGTCGACGGACGAACCGACCCGACCGTGATACTCGACACCG 
TACCGGTAAGCGCTCTTGACGGCGCAGTCGGCAAACCCGCCGGGGAAACCAGCTTCACCCTGCAACTGACCGGTTGCGCC 
GCTCCGGCGGCCGATGCCGAGGAGCACTTCAGCGTGATGTTCCAGGCGGTCAATCCGACCAGCGCCGGCAATCTGACCAA 
TACCGCGTCCGCCGGCGCCACCGGCGTAGCGCTGCAGCTACTGACGGCACCGGGCGGCAGCGAGGTCAATCTGGCCGGCG 
GGTCGGCCGTGGCTGCCGGTGACATCGTGCTCGCAGGAGGCGAGACCAGCACCAGCTACGACTATGCCGTCCGCTACATC 
TCCGAAGCGACCACCGTCACTCCGGGACCGGTGCTCGGCTCGGTGACCTACACCCTGCGTTACGAGTAA 



Protein sequence: (SEQ ID NO: 147) 

MKPQST ALT I AAFLALPG I AAAANT I T FHGEVTDQTC S AWDGRTD PTVI LDTVPVS ALDGAVG KP AGETS FTLQLTGCA 
APAADAEEHFSVMFQAVNPTSAGNLTNTASAGATGVALQLLTAPGGSEVNLAGGSAVAAGDIVLAGGETSTSYDYAVRYI 
— SEATTVT PGPVIX5SVTYTLRYE - 
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FIGURE 24 
RL045 

DNA sequence: (SEQ ID NO: 24) 

AGTCCGCACGGTAGTGACGACTGGAAGCGCTTCTGTGCTGCCAACAACCTGGAGCCCAGCATGAGCCGGCGCGGCAATTG 
TTGGGATATGCCGTGGCGGAATCCTTCTTCAGTAGTTTGAAGAAAGAGCGTATCCGCAAACGCATCTACAAAACCCGAGA 
CATGGCCCGGGCGGATGTTTTTGACTACATCGAGGTCTTCTACACCCGAACCCGGCGGCACAGTCATCTGGGTGGCGTCA 
GTCCCG AGGC CTTTG AAAGCGCCTCG 
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FIGURE 25 
RL046 

DNA sequence: (SEQ ID NO: 25) 

ATGGCTGAAGTCACTCAACGAGCAGAGCAGCAACAAGAGAGCCAGAAGACCCTTCTCGGCACCATCATCAGTACGCCCTT 
CCAATTTCTCGGCGTGATGTTCGGGTCGCTGATCGGCGCAATCATCGTGGAGTGGGTTTGCCTGTATTTCTTCTGGCCTG 
ACGCGGGCTGGAAGCATGCCCAGGCCATGTTTGAGTACGAACTCAGTTGGCTGTCGCAGGGGCTGCTACACAGCGTCGTC 
GTGC AGG AGC CAGGT CGAACCG C C AC CTGGCTG G C C C AG TTGGCCTATGACTGGTTGTTCGTG AAG ACCGGGATGGTCG A 
CTGGATGACCAACATGACTACCATCGCGCAGGCCGGGCCACGGAGCCCGCTGGACGTTCGCTATCTCACCGCCCAGGGTG 
TCTCCACGCTGCAGAACTACGGCCTGGCCGCGCTGTACACGGTGCTGACATTCGTCGTGCGCCTGGTGATCCTGGTCATG 
ACGATCCCGTTATTCGTGATGGCCGCGTTCACCGGCCTGGTGGACGGCCTGGTGCGCCGGGACCTGCGCAAGTTCGGCGC 
CGGCCGGGAGTCCAGCTACCTCTACCACAAGGCGCGCGGCAGCATCATTCCGCTAGCGGTCGTCCCTTGGACGCTCTACC 
TGGCAATTCCCATCAACATCAATCCCCTGCTCATCCTGTTGCCCTGCGCCGCACTGCTCGGCGTAGCGGTGTGCATCACA 
GCATCCACCTTCAAAAAGTACCTATAG 

Protein sequencejiSEQ ID NO: _1 48) _ . 

MeWqRAEQQQESQKTLLGTI I STPFQFLGVMFGSU GAI I VEWVCLYFFWPDAGWKHAQAMFEYELSWLSQGLLHSW 
VQEPGRTATWLAQLAYDWLFVKTGMVDWMTNMTT I AQ AG PRS PLDVRYLTAQGVSTLQNYGIAALYTVLTF WRLVI LVM 
TI PLFVMAAFTGLVDGLVRRDLRKFGAGRESSYLYHKARGSI I PLAWPWTLYLAI PININPLLILLPCAALLGVAVCIT 
ASTFKKYL. 



Figure 25 



1 



Title- VIRULENCE-ASSOCIATED NUCLEIC ACIDS AND \ 
PROTEINS AND USES THEREOF \ 
Applicants: Laurence Rahme et al. 

Filing Date: September 12, 2003 Serial No.: Not Yet Assigned 
Page 42 of 118 Customer No,: 21559 

FIGURE 26 
RL047 

DNA sequence: (SEQ ID NO: 26) 

ATGGCTGGCCAGTACCCGTTGGAAGCGCTCTTGCGGCCTGCCGTGGAGCTCTACACCACCACCGTGTGCTTCACCGCAGC 
CGCGCTCTG C ATCGTCGCGCCGTGG ACGTTCTCCCTCACTCCGCTGTTCGG CATCGTGGCCG CG CTGTGCTTCG CCTGGC 
TGGGTATCGTGCGGCTGAAGCAGGCCGGCGTGGTGCTCCGCTACCGGCGGAACATTCGCCGACTGCCGAAGTACACGATG 
ACCAGCGCCGAGATGCCGGTCAGCAACGAACACCTGTTCATCGGTAAAGGATTTCGCTGGACGCAGAAGCATACGCAGCG 
CCTGGCAGATACCTACCTGCCCCAGTTCGCCTCTTACGTCGAGCCCTCGCCCCTCTACGAGCGCGCGCGCCGGTTGGAGA 
AGCAGCTCGAGTTCGCCCCCTTCCCCCTGAAGCTGGTCGCCAAAGCCACTGCCTGGGACGTGGCCTGGAACCCCGCACGG 
CCGCTGCCGCCCGTGGGCGGTTTGCCTCGGCTCCATGGCATCGAGCCGCGCGAACAGGACGTAGGCCTGCAACTGGGCGA 
GCGCGTCGGCCACACACTGGTACTCGGCACCACGCGGGTGGGTAAGACGCGCCTCGCGGAGCTGTTCATCACCCAGGATA 
TTCG CCG CACTCACTG CCGGGT ACG ACG CCGG CGGGTG AAG ATGGGCCGG CGG ACCC AG ACGGTTCACCACGGCTATCGG 
CGCCGGCGCGCAGAGGAGCAGCCGGACTACGAGGTGGTGATCGTCTTCGACCCGAAAGGCGACGCTGACCTGCTGAAGCG 
TATGTACGTGGAATGCGAACGTGCCGGCCGCCTGGACGAGTTCTACGTGTTCCACCTCGGTCATCCTGACCTGTCGGCAC 
GCTACAACGCCGTCGGCCGGTTCGGTCGGATCTCCGAGGTCGCCACCCGCGTCGCCGGCCAGCTCTCCGGCGAGGGCAAC 
AGCGCGGCGTTCCGCGAGTTCGCCTGGCGGTTCGTCAACATCATCGCCCGCGCGCTGCACGCGCTGGGTATCCGGCCTGA 
CTACCAGCAGATCCTCCGGCACGTCGTGAACATCGATGCGTTGTTCGTCGAATATGCGCAGAAATACATCAGCGAGCACG 
ATCCCAGGGCCTGGGACACCATCATCCAGATCGAGGGCAAGCTCAACGACAAGAACATCCCGTTCAACATGAAAGGACGG 
CCCCTGCGGGTCGTAGCCATCGACCAGTACCTGACACAGAAACGCATCGCCGACCCGGTCATGGAAGGCTTGAAGAGCGC 
CGTGCGCTACGACAAGACCTACTTCGACAAGATCGTGGCCTCGCTGCTGCCGCTACTGGAGAAACTCACTACCGGGCGGA 
TCTCGGAGCTTCTTTCGCCCAACTACGCGGACCTCAACGATCCGCGGCCGATCTTCGACTGGATGCAGGTCATCCGCAAA 
CGCGCCGTGGTCTACGTCGGCCTCGACGCACTATCGGATACCGAGGTCGCCGCCGCGGTGGGCAACTCCATGTTCAGCGA 
CCTGGTCTCGGTAGCGGGTCACATCTACAAGCATGGTGTCGATGACGGCCTGCCCGGCTCGCTCGCCAGCGGCAAGGTCC 
GCATCAACCTGCATGCCGACGAGTTCAACGAGCTGATTGGCGACGAGTTCATCCCCATGGTCAACAAAGCGGGCGGCGCC 
GGCGTGCAGGTGACGGCCTACACCCAGACCATGAGCGACATCGAGGCCAAGATCGGCTCCCGCGCGAAGGCCGGTCAGAT 
CATCGGCAACTTCAACAACCTGTTCATGCTGCGGGTGCGCGAGACCGCCACGGCCGAACTCCTTACCAATCAGCTCCCCA 
AGGTCCAGATCTACACCAGCACGCCGGCGAGCGGCGCCAACGACGCGATCAACAACAACAAGAAGGTAGCCTTCACCTCC 
AGCTCGCACGACCAGGTGCAGATGACCAGCGTGCCGATGCTCGAGCCGGCCCACATCATTGGTCTGCCCAAAGGACAAGC 
GTTCG CG CTACTCGAGGG CGG CAATCTCTG G AAG ATCCGAATGCGG CTGCCGGCGGTCGCCCCCG ACG AGGTGATGCCGA 
AAAGCCTGCAGGAGCTGGCTGCCGGTATGCGCAAGGGCCAGGCCGCCAACAGCGAGTGGTGGGAGGCGCCGGGATACTCC 
GCCCTGCAGGATGGTCTGCCCCAGGACCTGGTCGACGATTTCCGTCACCTCGGCACCGGTGAGGATGCCGCCTGA 



Protein sequence: (SEQ ID NO: 149) 

MAGQY P LE ALLRP AVE LYTTTVC FT AAALC I VAPWT F S LTPLFG I V AALCF AWLG I VRLKQ AG WI»RYRRN I RRLP KYTM 
TSAEMPVSNEHLFIGKGFRWTQKHTQRIADTYLPQFASYVEPSPLYERARRLEKQLEFAPFPLKLVAKATAWDVAWNPAR 
PLPPVGGLPRLHGIEPREQDVGLQLGERVGHTLV1X3TTRVGKTR1AELFITQDIRRTHCRWRRRVKMGRRTQTVHHGYR 
RRRAEEQPDYEWIVFDPKGDADLLKRMYVECERAGRLDEFYVFHLGHPDLSARYNAVGRFGRISEVATRVAGQLSGEGN 
SAAFREFAWRFVNI I ARALHALG I RPDYQQI LRHWN I DALFVE YAQKYI SEHDPRAWDTI I QI EGKLNDKNI PFNMKGR 
PLRWAIDQYLTQKRIADPVMEGLKSAVRYDKTYFDKIVASLLPLLEKLTTGRISELLSPNYADUTOPRPIFDWMQVIRK 
RAVVYVGLDALSDTEVAAAVGNSMFSDLVSVAGHIYKHGVD^ 

GVQVTAYTQTMSDIEAKIGSRAKAGQIIGNF1WLFMLRVRETATAELLTNQLPKVQIYTSTPASGA1TOA1N1WKKVAFTS 
SSHDQVQMTSVPMLEPAHIIGLPKGQAFALLEGGNLWKIRMPLPAVAPDEVMPKSLQELAAGMRKGQAANSEWWEAPGYS 
ALQDGLPQDLVDDFRHLGTGEDAA 
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FIGURE 27 
RL048 

DNA sequence: (SEQ ID NO: 27) 

ATGACTACTCATCTGATCACCCTAGTCATCAAGCAGCCGAGCGACGCTCAGGCACGCCAACTCATGTACCAGGAGTTGCT 
CGG ACTG ATCTCACGCTACGGCGG TG AGGTG ACG TC C AAGG C CTTGG AGG ACG AGTCG ACCCTCTGCGAG CTG CTGGTGC 
AGATGCTGCCTGATCATGAGGTAGAGCAAGCCAGGAAACAGGTGCTCGAACTTCATGCCAAGGGCCGCCTGCAGGCGCCG 
GCAAGCCTGAAGGTGTAA 

Protein sequence: (SEQ ID NO: 150) 

MTTHLITLVIKQPSDAQARQLMYQELLGLISRYGGEVTSKALEDESTLCELLVQMLPDHEVEQARKQVLELHAKGRLQAP 
ASLKV 
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RL049 

DNA sequence: (SEQ ID NO: 28) 

ATGAAGAAGTTCCTTGCCACGCTGGCATTTTGCACGGCGTTCGCGACTCAAGCCTGGGCCGCCGGGCTGATCGTTGTCGA 
AGACCTCGGCGGCGCCTCGGCGCTCCCCTACTACCAGGGCCTGGATCCGCAGCCATCCGCTTCCGCACCAGGACCTGGCX3 
ACCTGGGCGTCCGTGGCTCAGGTGCGTTTCCAGTTCGCTCCGCCCGCCTATCGCCAGGACGGGTCCAGGGGCGCGCCATC 
AACGCT C C AGG C CTG C AACTGCTGTTC CTGGTCGG CG ACG ACACG CTGTCTCG AACCTGGCTG AAAG AG CG AGG CG ACGA 
GCTTCGAGACCTCCAAGCCGTGGGCCTGGCAGTGAACGTGGCCAGCGAAGCGCGCCTGACGGAAATCCGGGCCTGGGGGA 

AAGGACTTCAGATATTGCCGGCGCCGGCGGACGACCTGGTCGACCGGCTAGGGCTGCAGCATTACCCCGCCCTCATCACA 
TCCACCG C CATC CAG C AG TAG 



Protein sequence: (SEQ ID NO: 151) 



MKKFLATLAFCTAFATQAWAAGLIWEDLGGASALPYYQGLDPQPSASAPGPGDLGVRGSGAFPVRSARLSPGRVQGRAI 
. NAPGLQLLFLVCSDpTLSRTWLKERGDELRDLQAVGIAVNVAS EARLTE I RAWGKGLQI LPAPADDLVDRLGLQHYPALI T 
STAXQQ . . 
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FIGURE 29 
RL050 

DNA sequence (SEQ ID NO: 29) 

ATGGCAACGTCTGTAGTTCGAGCCCTCCAGTTGGCCACCCTGCTGGTCCTGGTCAACATCGCTCAGGCCGCCGTGGATCC 
ACCGCCGGCGTACAAGCAAATCGCCCTGCCCAAAGGGGTTCCGGCCGAGGTGCTCTACTCGGTCGCGCTGACCGAGAGCA 
AGGTCCTGCTGCGCGGCGAATACGTTCCCTGGCCCTGGACATTGAACGTCGCCGGGAAATCTTACTACTACGCGACCCGC 
ACCGCCGCCTGCACAGCGCTACTCGCGGCGATCAACCTCTACGGGGCCAAGAGCGTCGATTCCGGCCTCGGCCAGGTCAA 
CATCGGCTGGAACGGACATCGTTTCTCCAGCCCCTGCGAGTCCCTGGATCCGTACAAGAACCTGGACGCCACCTCCGACA 
TCCTGATCGAGCAGCGGGACGCCCTGTATGCATCCGCCCCGGGAAGACCGGTGGACTGGATCCAAGTTGCCGGCCGCTAC 
CACCGCCCCGCCGGCGGCGCGCCTGCCGCCAAATACCGTAGGACGGTTTCCCGCCACCTTAGCCAAGTTCTCGGCGTCAA 
CCTACTGG'TGACCAATCCATGA 

Protein sequence (SEQ ID NO: 152) 

MATSVVRALQLATLLVLVNIAQAAVDPPPAYKQIALPKGVPAEVL 

TAACTALLAAINLYGAKSVDSGU3QWIGWNGHRFSSPCESLDPYKNLDATSDILIEQRDALYASAPGRPVDWIQVAGRY 
HRPAGGAPAAKYRRTVSRHLSQVLGVNLLVTNP 
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FIGURE 30A 

RL051 

DNA* sequence : (SEQ ID NO: 30) 

ATGATCAGAACCGTATCGCTCCTGTCCGGCCTGATGCTGCTGCTGAGCTATCCCGCAGCCGGCCAGGAGGCGGCGGCAAG 
CCGAGAGGCCAGCAGCCAACTGTCCGGTAGCCAACTCGGCACGCTGAAACAACAGACATCTCAGAGCGACCTGGCCCAGG 
AGTGGGGACTGAACCAACAGGAATGGACCCGCTACCAGACGCTCATGCAAGGCCCCCGGGGCGCTTACTCGCCTGGTATT 
GATCCGCTGACCGCGCTGGGCATCGAGGCGCGATCGGCAGAGGAACGGCGGCGGTATGCCGATCTACAGGTCCAGGCCGA 
ACGGCGCCGGGTCGAGAAGGAACTCGCCTACCAGCGCGCATACGACGAAGCCTTCGCCCGCGCCTATCCAGGCGAGGGGG 
TGATCCGCCTCACCGAAAGCAGCACAGCCAACCCGTCGGGCACGCCGAACATGAGCCCAGCGTTGCAGAGCAGCGGGCGC 
CTGG CC CTG TTCGTCCAGG AC AACTG C AC CG CCTG CATCCAGCGGGTCCG CG ACCTG C AAC ATGCAG AAAAGG AGTTCG A 

CCTCTACTTCGTCGGTAGCCAGAACGACGCAGAGCGAGTGCGGCGCTGGGCAATCCTCGCCGGCATCGACCCGAAGAAGG 
TTCGCAGCAAGCAGATCACGCTCAATCATGACGAGGGCCGCTGGATGGCCCTAGGACTGGGCGGAGCCCTTCCCGCCCTG 
GTACAGGAGGTGAACGGCCGATGGCAACGTCTGTAG 



Protein sequence? (SEQ ID NO: 153) 

M I RT V S LL S G LMLLLS Y P AAGQE AAA SREASSQLSGS QLGTLKQQTSQ SDLAQ E WG LNQQ E WTRY QTLMQG PRG AYS PG I 
D P LTALG I-E ARS AE ERRR YAD LQVQAE RRR VE KE-IAY-QRA YDEAFARA Y-PGEG VI RLT-E S S TAN P SGT-PNMS P ALQ S S G R 
: ^£FVQDNCTACIQRVRDI^^ 
VQEVNGRWQRL. 

RL052 

DNA sequence: (SEQ ID NO: 31) 

ATGAAACGCCCATCCCCTGCATCAATGATTCTTGGCCTCTGTTTGACGGCAATGGCCGGCCTGCTGAGCTACCAGCAGTA 
CCAACTCGTTCAGCTCCGATCAGGCGTGGACAGTGCCGCGGAAAAGGCCTCGCTGGAGGCGATCCTGGCTCGCTTGAGTC 
GAGTCGACGAGCGCCTCGACGCCGTGGATGGACAGCACCTGGTCAGCAACGAGGACTTCCGTTCAGGCCAGCAGGCGCTG 
TCCAACCGAATTGACGCTGCGCAGGCGTTCGCCAAGCAGGCCTCCGATGCCGTCGAGAACCTGGCTCAGACCACCGCCTC 
GGCCGGCGACCTCTTGGTGCTCAAGGCAACCGTGGAGACACTGGACGGTTCTGTCCGCACGCTTCAAGAAAAGCAGGCCA 
AGGCGCCGCCGCTGATCGTGCCAGCGCCAAAACGCCCCATACCCGCCAAGCCCAAGCCGAAACCCAAACCGATGGAGCCC 
CCGCCCTTCTCGATCCTTGGCGTGGAGTATCGCGGGGGAGAACGGTTTCTGTCGGTTGCACCTCCGGGATCCACCCAGCT 
CAGCCAGATCTACCTCATTCGCCGGGGAGATGCCGTCGCCGGCACGACCTGGCGACTGACCGACCTTGACGATGGTACCG 
CGCACTTCGACGTCGCCGGCACCTCGCGCAGCGTTCGCATCCAACCATAG 

Protein sequence: (SEQ ID NO: 154) 

MKRPSPASMILGLCLTAMAGLLSYQQYQLVQLRSGVDSAAEKASLEAILARLSRVDERLDAVDGQHLVSNEDFRSGQQAL 
SNR I D AAQ A F AKQ AS D AVENLAQTTAS AG DLL VLKATVETLDG S VRTLQE KQAKAP PL I V P A P KR P I P AKPKPKPKPME P 
P P FS I LG VE YRGG E RFLS VA P PG STQL S Q I YL 1 RRG D A VAG TTWRLTDLDDGTAH FD VAGT S R S VR I Q P . 

RL053 

DNA sequence: (SEQ ID NO: 32) * 

ATGCCGCGCCGCTTGATCCTCTCGGTCACGGAGCGGGATATCCTATiTGCACTGCCGGTAAGCCGAATGACCTCACTGAC 
TACTCCACCTCAACGAGTCCGCCCCATCGTCGATCCGCCAGCGACGCGGCGATGCCAATCACTTGGTTTTTTCGGTGCAG 
GTCAGCCTGCTGTGCTATCCAGCGTTCAGCCCTGATGCGCGACGAAGAGCCCCCGAG 

RL054 

DNA sequence: (SEQ ID NO: 33) 

ATGGCCGAAGCTATCAGAAAGGATGCAATGATGACAAAACTCTACTTTGATCTTCTGAACTCGCCTGCCGAGGCTCATTC 
GTCGATACAAAAGTCTTTATCTGTGCAGGCAATCTCCACAACTGTCCCAATACTGGAGTTTCCTTCGGAAACCGTATACG 
CCTATGCATCGTACATAAATGCATTAAGTATCGGTCAACGCATAGATCCTGCATTCACCCAGAGCTTAACGAGTGCCATA 
TCCAACCTGGCAGGTCGCCCGATTGCAGTAAGCGACATTTACCAAAAAATTCATGAAACCACACTGAGAACACCTGTTGA 
GATGGGCGTTCGTCCTAATAGCATCACCTTTGAGGAGTATCAGGCCACCATAAATCAGCAAGCCATCAACATGGTTCAAG 
ATATGCAGGATGGAGACAAAGGTGAGAAGGTGGAGGCCCTCCAGGCCAATATGCAGTTCCTGTATGGACAGGAGATAAAT 
ACTGATTTCATCGCTCGTAATGAACTCGCTGCTGGGCAGAGAGCGAAAACCGTCGCAATAGTTCAGGGGCATATCACCAT 
CGGGTACGGCTTCGATACCTTCGTGCATGAAGCGTCCGAGCTAAACTCTTTGAATCTTGTTGGTTCTACGCGACAGAAGG 
TATTACCTGCATTGCAGCTATCAACGTCCGACCCAGGCTTCTGGAGCGTCTATGCCTTGCTGGGACAAAGTCTCACGGAT 
GACGATGGGCTATTACTCTTTAGTGCCAAAGCGCGAGCTGTTGTTCAACGCATAGCAAGCAACCAGTTTGCAGGTAAGTG 
GAATGGGCTACCCCCAGCTATCAAAACGGTTGCGCTTGATCTATATTATCAATATGGGCAGACTGGTAATTTTCCAAAAT 
TTCAACAAGCTATAAATAGCCATGATTGGCCGGCAGTCATCCATGAACTTAGAAACTGGAATGGTGTACCGAATGATCCT 
CTCCAGTTCATTACAAAACGATTGGAAGAGCGAGCCAAGTATCTGGCAATATCCTTCAACTATGAGCAATGA 

Protein sequence: (SEQ ID NO: 155) 

MAEAIRKDAMMTKLYFDLLNSPAEAHSSIQKSLSVQAISTTVPILEFPSETVYAYASYINALSIGQRIDPAFTQSLTSAI 

SNLAGRPIAVSDIYQKIHETTLRTPVEMGVRPNSITFEEYQATINQQAINMVQDMQDGDKGEKVEALQANMQFLYGQEIN 

TDFIARNEIjAAGQRAKTVAIVQGHITIGYGFDTFVHEASELNSLNLVGSTRQKVLPALQLSTSDPGFWSVYAL 

DDGLLLFSAKARAWQRIASNQFAGKWNGLPPAIKTVALDLYYQYGQTGNFPKFQQAINSHDWPAVIHELRNWNGVPNDP 
LQF I T KRLE ERAK YLA I S FNYEQ . 
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FIGURE 30B 

RL655 

DNA sequence: (SEQ ID NO: 34) 

ATGAACAACACAGTGAGCGAAACGCAACAGATCAATATTTACCAAAATCCGGGGCAGTCTATTTCCGGTCTCTACAAGGG 
GCTGGCTAACCAGTGCTCTCCTGGCCAGCCATTTCCAGAGGTACAGCTTGTGGAGGCTTGGGATATCCCTCTCGTACTCC 
ATCCGGAGTTTGTGCCTAACGGAGATGTCTCGAAAATCGATAAGGAGTACGGAACGATCCTTGCTGCTGAGTCAGCTCAG 
GTTATCCTGCTTCAACTCCAAATGGCTCAAGACAAGGCTAAGGCGTGCGGGGAGGTTACAGCCTTGATCAGTTCTGTCTC 
CTCCAATCTCAATACCATTAAGAGTCGTCATGGTGCTAATTATCTAAACCTGCTGAAACAATCACCGAACCGATACCCGA 
CTAGCGTCGGAGTTGAGATCATGTCAGGTGGCAGTCCGAACCAGGATTCTGGAATCGAGGTCTCTTACGGTGCCAGTCTC 
GGCCGTCTAACTCAATCACAACTTCAGGCGATGAATCTGCCTGCCAGTCTCAAACAGTTGCTCACTCAGGGAATCGGTGT 
GAAGCTTTCTCAGCCTGAATATTGGCCTGCTTACAACAACATAGCCACTGGTATTCGTTATACAACCGGAGTGGCGATAA 
CGTTGGCCTATTGGGCCACGGTTTAG 



Protein sequence: (SEQ ID NO: 156) 

MNNTVSETQQINIYQNPGQSISGLYKGU^QCSPGQPFPEVQLVEAWDIPLVLHPEFVPNGDVSKIDKEYGTILAAESAQ 
VILLQLQMAQDKAKACGEVTALISSVSSNLNTIKSRHGANYLNLLKQSPNRYPTSVGVEIMSGGSPNQDSGIEVSYGASL 
GRLTQSQLQAMNLPASLKQLLTQGIGVKLSQPEYWPAYNNIATGIRYTTGVAITLAYWATV 

^D^ sequence: (SEQ ID NO: 35) ^ ' ' " 

ATGACCCAAGCTGCGAAAATACCAGCAAATGAGTACTCATTGGGGGATGGAAGAGGCTACATCAATATCTGGCCGGAAAA 
GGATGAGGCTCAGGCATTTCTTATCCATAATGATGGGCCTAATGGGGCTACATGCAGCCTTAAAGGCACTCTTAGAGATA 
ATAAAGGAGTGGTGCATTCGCCGTATTCCTCTGCTTCATGTTTGCTAAGTATCACCCAGACAGGGCTGCTGTCAGTAAGC 
GTCAAACGTGAGGAAAATTCGCCAAGCTGCTCTGCATGGTGCGGTCCTAGAGTTTGGTTTGAAGGAGCCTATAGCGTCCC 
GCCCAAGGGCTGCTACTATATGCAAATAAGGAAAAAAACTCGACAAATGTTGGGTATGATTGAGAAAAAAGAGCTTGATG 
CCGCTCGCGCCTTATCAAATAAGCTTTTGTCAGACTGCGCAACCGAGCTAGCCTATCCTGCCAAGATATACTTGACGAAC 
A CACTTG CCATG AT CAGTG CTG AAAAGGG AG AG AATG CTCGCTGTTTG G AG TATGCCCATCGGG TG C AAAAGCAAATTCC 
TGTAAGAG ATG ACGG CCAAC CGG CTG AAG A CTTGCTCC CGG CGG AGCACGCTTTCGCT ATGG AACAACGCG CCAAGGCTG 
ATGCTCTGTCTGAGCGATGCAGCGACGAGAAATAA 



Protein sequence: (SEQ ID NO: 157) 

MTQAAKI P AN E Y S LGDGRG Y I N I WPE KD E AQAFL I HNDG PNG ATC SLKGTLRDNKG WH S P YS S AS CLLS I TQTG LLS VS 
VKREENSPSCSAWCGPRVWFEGAYSVPPKGCYYMQIRKKTRQMLGMIEKKELDAARALSNKLLSDCATELAYPAKIYLTN 
TLAMISAEKGENARCLEYAHRVQKQIPVRDDGQPAEDLLPAEHAFAMEQRAKADALSERCSDEK 

RL057 

DNA sequence: (SEQ ID NO: 36) 

GTGCTGGTAGAGCGTTTGCCGACTGATGTTGAATTCGCGGGCGAGCTGAGCCTTGGGCTCGCCGGCCGCTGCCCGCAGCC 
CCAGGGTAGCACCTGCTTGTCGGACAAGGCCTCTTTGCGGCCCCGGTACGCGCAGAGCTTGATATCCTCGCGCTACCGCG 
CTGGTGCTGCTTGTATGCTGCTCAGCAAGCCAGCCGCCGGTCTTTTCAGGGTAAGCGTACGGCCAATACACCTTTACTTA 
GGTTGA 

Protein sequence: (SEQ ID NO: 158) 

VLVERLPTDVEFAGELSLGLAGRCPQPQGSTCLSDKASLRPRYAQSLISSRYRAGAACMLLSKPAAGLFRVSVRPIHLYL 
G 

RL058 

DNA "sequence: (SEQ ID NO: 37) 

ATGG ATATTCGC CTG G AG ATTTT AGCG CTTG AACAGCTGTTG CTAG AG CCGG AATCG AGAAAGAATGAT CGACTGCTTAA 
ACAGCTGCTTACCGAAGACTTCGTTGAATTTGGAGCTATCGGCAAAAGCTGGACGAAAGCGGAGGTGATCGTGGGACTAA 
AATCCCAGACTTGGATCAAAAGGACAATCGAGGATTTCAAACTGCGTGTGCTTGCAGATGGTGTCGCGTTAGCAACGTAC 
CGATGCCGTCATCAAAATGCTAATGGCGATGAGTCGTTATCAATGCGTAGCTCTGTTTGGAAAACCTACGAAGATGGTTG 
GCACATGGTGTTTCACCAAGGCACGAGGGTCTCCGAGTAG 

Protein sequence: (SEQ ID NO: 159) 

MDIRLEIIALEQLLLEPESRKNDRLLKQLLTEDFVEFGAIGKSWT 

RCRHQNANGDESLSMRSSVWKTYEDGVmMVFHQGTRVSE. 
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RL0|9 

DNA sequence: {SEQ ID NO: 38) 

ATGACTTCCTCGCCCAACCTTGACCAGATGACCCCGGAACAGCTTCGTGCCTTGGCGGCACAGGCGTTGCAGTTGCAATC 
CCAGGTCGAGGCGATGAGCAGGAAAATCCGCAACAATGAAACCCTCATCGAACAGTTCAAGTTCGAAATCGCTCTGCTCA 
AACGCCACAAGTTTGCCAAGCGCAGCGAGCAAATCAGTTCGGCGCAAGGCAGCTTGCTGGATGACCTGCTCGACACCGAC 
CTTGAAGCTATCGAGGCCGAGCTGAAACAACTCCTTCCAGCTTCGCCACAAGCCGAGCCACGGCAATCCCCGAAACGTTC 
GCCATTGCCGCCGCAGTTCCCGCGCACGGTGATTCGCCACGAACCTGAAAATACCCAATGCGCCTGCGGCTGCCAACTTC 
AACGCATCGGCGAAGACGTCAGCGAGAAGCTGGATTACACGCCGGGCGTGTTTACCGTCGAGCAACATGTGAGGGGCAAA 
TGGGCCTGCCGTCAGTGCGAAACCCTGATCCAGGCGCCGGTGCCAGCCCAGGTTATTGATAAAGGCATCCCGACCGCAGG 
TTTGTTGGCCCACGTGATGGTGGCCAAGTTTGCCGATCACTTGCCGCTGTACAGACAGGAAAAAATCTTTGGCCGCGCCG 
GGCTGCCAATTGCCCGCTCGACCCTGGCGCAGTGGGTCGGACAAACTGGCGTGCGGCTTCAGCCACTGGTCGATGCACTG 
CGTGAAGCCGTGCTGAACCAGGACGTGATCCACGCCGATGAAACACCGGTGCAAATGCTTGCACCAGGCGAGAAGAAAAC 
CCACCGGGTCTATGTCTGGGCCTACAGCACGACGCCGTTTTCGGCGCTCAAAGCGGTGGTTTACGACTTCAGCCCAAGCC 
GTGCCGGAGAACATGCACGCAACTTCCTAGGCGACTGGAATGGCAAGCTGGTCTGCGACGACTTCGCTGGATACAAGGCC 
GGTTTTGAACAAGGCATCACTGAAATCGGCTGCATGGCTCATGCTCGCCGCAAGTTCTTCGACCTGCATGTCGCTAACAA 
AAGCCAACTGGCCGAACAGGCGCTGCACTCAATTGGCGGTTTGTACGAGGTTGAACGCCAGGCTCGGGACATGAGCAACG 
AAGACCGTTGGCGAATACGTCAGGAAATGGCGGTACCGATCAGCAAAACACTGCATGACTGGATGTTGGCCCAGCGCGAC 
CTGGTGCCCAACGGCTGGGCCACAGCTAAAGCCCTCGAGTACAGCCTGAAACGCTGGGGAGCGCTGACGCGGTACCTGGA 
"GGATGGGGCTGTGeGCATCGACAAGAATGAGGTGGAGAACGAGATAGGGGGGTGGGCGCTCGGACGCTCGAACTGGTTAT 
TTGCCGGATCGCTGCGCAGTGGCAAACGAGCAGCAGCTATCATGAGCCTGATCCAGTCCGCTCGCATGAACGGGCATGAT 
CCGTATGCCTACCTGAAGGACGTGCTAACTCGCCTGCCGACGTTACGGTCGAAAGACATCAGCCAGTTGCTGCCGCATCA 
GTGGGTACAGATCTAG 

Protein sequence: {SEQ ID NO: 160) 

MTSSPNLDQMTPEQLRALAAQALQLQSQVEAMSRKIRNNETLIEQFKFEIALLKRHKFAKRSEQISSAQGSLLDDLLDTD 
LEAIEAELKQLLPASPQAEPRQSPKRSPLPPQFPRTVIRHEPENTQCACGCQLQRIGEDVSEKLDYTPGVFTVEQHVRGK 
WACRQCETLIQAPVPAQVIDKGIPTAGLLAHVIWAKFADHLPLYRQEKIFGRAGLPIARSTLAQWVGQTGVRLQPLVDAL 
REAVLNQDVIHADETPVQMLAPGEKKTHRVYWAYSTTPFSALKAWYDFSPSRAG 

GFEQGITEIGCMAHARRKFFDLHVANKSQIAEQALHSIGGLYEVERQARDMSNEDRWRIRQEMAVPISKTLHDWMLAQRD 
LVPNGSATAKAIJ)YSLKRWGALTRYLDDGAVPIDNNQVENQIRPWAIX5RSNWLFAGSLRSGKRAAAIMSLIQSARMNGHD 
PYAYLKDVLTRLPTLRSKDISQLLPHQWVQI . 

DNA sequence: (SEQ ID NO: 39) 

ATGATCCGCATCGATGCGATCTGGCTAGCCACCGAACCGATGGACATGCGCGCCGGCACCGAGACGGCATTAGCCCGGGT 
AATTGCGGTGTTCGGTGCGGCGAAGCCGCACTGCGCTTATCTGTTCGCCAATCGCCGGGCTAACCGAATGAAAGTGCTGG 
TGCACGATGGCGTGGGCATCTGGCTTGCCGCGCGTCGACTGAACCAAGGCAAGTTCCACTGGCCCGGCATTCGCCATGGC 
TGCGAGGTCGAACTCGACAGCGAACAACTCCAGGCCTTGGTGCTGGGCCTGCCGTGGCAGCGCGTCGGCACAGGCGGTGT 
GATCAGCATGCTGTAA 

Protein sequence: (SEQ ID NO: 161) 

MIRIDAIWIATEPMDMRAGTETAIJ^VIAVFGAAKPHCAYLFANRRANRMKV^ 
CEVELDSEQLQALVLGLPWQRVGTGGVISML . 

WM 

DNA sequence: (SEQ ID NO: 40) 

ATGCGCCAACGAAGCTCTTACCCGAAACCGTTCAAAGCCCAGGTCGTTCAGGAATGCCTGCAACCTGGGGCAACGGTGTC 
CAGTGTCGCCATCAGCCACGGCATCAACGCCAATGTCATCCGCAAATGGCTGACGCTTTATCGAGACCAGCCCGTACCAG 
CCTCGTTACCAGCCTTTGTCCCGCTGAAGGCCACCCCTAAACGGCCAGCCGAAACGTCAGTGCTCATTGAACTGCCCATG 
GCCGGGCAAATGATCACGGTGAAATAG 

Protein sequence: {SEQ ID NO: 162) 

MRQRSSYPKPFKAQWQECLQPGATVSSVAISHGINANVIRKWLTLYRDQPVPASLPAFVPLKATPKRPAETSVLIELPM 
AGQMITVK. 



Figure 30 



Title: VIRULENCE-ASSOCIATED NUCLEIC ACIDS AND 
PROTEINS AND USES THEREOF 
Applicants: Laurence Rahme et al. 

Filing Date: September 12, 2003 Serial No.: Not Yet Assigned 
Page 49 of 118 Customer No.: 21 559 

FIGURE 30D 

RL062 

DNA sequence: (SEQ ID NO: 41) 

ATGGCTTTATCTCTTATTCGTAGTCTCACTGCGTCCGCCTCACGAAACATCTCGGCGTTGAAACGCGATGCCAAACGCTT 
GCAGAAGAACTCCTTTCTTGTGTTTGGAACAGAATATCCACTCAAGGTTTGCCAAAATGCGGTAGCAGTTTCTCGCGGCT 
TCCGCTCACTCGCTGATGTCGATAAACTGGAGCAGCACATTGGCATGAATAGAAGCGCTCCATTCTGGGTGATCCGTGGC 
CGCAACGATACACACCAGGGGGTACTGGAAGCGCTATATTGTTTAGACGTTGAATATACCGAGAATGGCCCCGTCGTTTT 
TACTGGAAACCCAAAGCACTCTATACTTCCAGCCTTAGTCCTTTTTCTTGAGCAAATGAGCTTTAAGAAACTACCCGGAC 
TAATCCTCATCGAAACAAAAGAGACCTCAATCCAAACAACCCATATATTCGACGCAATAGAAAAATTAGAAGTCGAAGAA 
ACTCTAAATAAATTTCGATTTCTTGACTTGCGAGACCGAAACCTTCCCGTTTCGCTTAGTACCGAGGCTCGTTGCTGGAT 
CGAGTCAATTGTCAGTTTATTGCCAAACGACATCCAAGAGGAAATACGTAATAAAGGATGGTCAACTCACTTAGAGATCA 
GTGCATATGAGCATGCAAAGTCTCGTAATCAAGTATTTGGCTCCTCCAACTTCCCTTGCGTCCCCTTCCTCTCCATAAAG 
TCAGCGATCTATCAACTCATTTCAGGCGCATACCCTCCCTTATGGATGCAGCCATCCTCCTCTGGCGAAATATCTAAAGT 
TGATATACGCCGACCTCCTCTCGAAAAAAGCTCAGAGGAAACCTTACTTTATCTCATAAAAAAATTAGAGAATCGACAGT 
TCCACACAGGCATTTCATGTGAGCATGAGAGTCGATGGCGGCCGTATGTCGTACTCTTCTCCAGGAATGATCCGGCTAGC 
GAGGTACTAGCAGGAGTTATACACTCGTACTTTTCTTGGAAGCAAGATAGAGACCATCGCTCACCCACCCTTTATGTTTC 
AGATGGAGCAGTTCCCTATGCTCCCAAGCTTCTAGGTTTAGGCGGCCATACGGTCATTGCAAATGGAATCACTGAAATTC 
CCGACGGGGATGGTCTTGGGGAGTTCTATGGCTACAAGAACTCACTTAAAGTCAGCTCCTTATCTAACGGAATACAGTTC 
ATGGGTAAGCATGTATCACTAAAGTAA 

A. - .. - - - ■ 

Protein sequence: (SEQ ID NO: 163) 

MALSLIRSLTASASRNISALKRDAKRLQKNSFLVFGTEYPLKVCQNAVAVSRGFRSIJUDVDKLEQHIGMNRSAPFWIR^ 
RNDTHQGVLEALYCLDLEYTENGPWFTGNPKHSILPALVLFLEQMSFKKLPGLILIETKETSIQTTHIFDAIEKLEVEE 
TLNKFRFLDLRDRNLPVSLSTEARCWIESIVSLLPNDIQEEIRNKGWSTHLEISAYEHAKSRNQVFGSSNFPCVPFLSIK 
SA1 YQLI SGAYPPLWMQPS SSGEI SKVDIRRPPLEKSSEETLLYL1 KKLENRQFHTGI SCEHESRWRPYVVLFSRNDPAS 
EVLAGVIHSYFSWKQDRDHRSPTLYVSDGAVPYAPKLLGLGGHTVIANGITEIPDGDGLGEFYGYKNSLKVSSLSNGIQF 
MGKHVSLK. 

DNA" sequence: (SEQ ID NO: 42) 

ATG AA CG CTCTG AC C C AACCGG C CGCCCTCGCCG CCTCC CACCTG AACAT CAACCTGACCG ACTTCAT CG ACG AGTTCGG 
CGACGAGCTCCTGGAGTCGCTCAATCGCTCCAACCCCCCGGTCTATACCGGCTCCGTCAACGCTCACCGCCAGTTGGTGA 
TGGACCGACTCAAGCGCAAGCCCTTCGCGGCCCAGGCCGAGGTCGTCCAGGCCATCACCGCCCTGCTGCTGGACCGTAAC 
GAGCAGGCCGGAATCATCAACGCCGAGATGGGCACCGGGAAAACCATGATGGCCATCGCTGTCGCAGCGGTCATGCACGC 
CGCCGGCTATCGCCGGACCCTGGTCGTCTCTCCGCCGCACCTGGTCTACAAGTGGCGCCGCGAGATCCTGGAGACCATCC 
C AGCCG CCCG CGTCTGG GTACTC AATGG C C CAG AT ACTCTACTC AAGCTG CTCAAGCTGCG AGATC AG ATGGGCG ACGCC 
TACGACGGGCGCCAGGAGTTCTTCATCCTCGGCCGCGTGCGGATGCGGATGGGTTTCCACTGGCGGCTCGCCTGCTGGAA 
GAAACGCGCCGCCGGCGGCCAACTGCTCGCTGCGTGCCCGGATTGCGGACAGGTCCTCGAGGACCTGGAAGGCAACCTGG 
TCACGGTGGAGGAGTTCGAGCGTGGTGACCGTCGACGTACCTGTTCCTCCTGCCGTGGGGCGCTCTGGACGCTGATCCGA 
CCAGGCAAGCCCGACGGCGGCAACCGGCGCGCAACGATTCTCAAGTCGATGTGCCGGATACCAACCATCGGCCCGGTCAG 
GGCGG AG CGC CTG CTG AACG ACTTCGGCG AGG ACTTCCTGG C CACG ATGTTGGTGG ACAACGTCT CGG AGTTC ATCAACC 
TGATGGACGCCAAGGGCAACTTCGTCTTCAGCGATCGGCAGGCCAAACGCATGGAGCGATCGATGGCAAACATCGAGTTC 
GGCTTCGGTGAAGGCGGCTACCAACCGACCGAGTTCATCAAGCGCTACCTACCTGATGGCTACTTCGACCTGCTGGTGCT 
GGACGAGGGACATGAGTACAAGAACAGCGGCTCGGCCCAGGGCCAGGCCATGGGCGTTCTCGCAGCCAAGGCACGGAAAA 
CCGTGCTGCTGACCGGAACGCTCATGGGCGGCTACGCCGACGATCTGTTCTATCTCCTGTTCCGCATCCTCACCCAGCGC 
ATGATCGAGGACGGCTATCGGCCCAACGCGCGCGGCAGCATGGCTCCCGCAGCCATGTCGTTCATGCGCGACCACGGTGT 
GCTCAAGGATATCTACACCGAGCGCGACGGTGATTCGCACAAGACAGCGCGGGGCAAGAAGCTCTCGGTACGCACGGTGA 
AGGCTCCCGGCTTCGGCCCAAAGGGCATCCACCGCTTCGTATTGCCGTTCACCGTGTTCCTGAAGCTCAAGGATATTGGT 
GGCAACGTACTGCCCGACTACCAGGAGGAGTTCATCGACGTGCCCATGGCGCCTGAGCAGGCTTCGGCCTATCAGCGCCT 
GGCGGCCACGCTGACAGCGGAGCTCCGCCAGGCTCTGGCGCGACGAGATACCACGCTCCTGGGCGTGGTCCTCAACGTGC 
TGCTGGCTTGGCCGGACTGCTGTTTCCGACCGGAGATCGTCAAGCATCCGCGAACCCGGGACACACTGGCCTTCGTGCCA 
GCGATCTTCGGTGACGAGCAGTTGATACCCAAGGAGCAGGTGCTGGTGGACCTCTGCTTCGAGGAGAAAGCGAAGGGCCG 
CAAGGTTCTGGCATACACCGTCTACAGCGGGACGCGCGACACCACGTCCAGGCTGAAGAAAGTGCTCGAGCAATCCGGGC 
TGAAGGTGGCAGTGCTACGTGCTTCGGTCGATACCGCTCGACGCGAGGATTGGATCCTCGACCAGGTCGATCGCGGCATC 
GATGTGCTGATCACCAACCCGGAGCTGGTGAAGACCGGGCTGGACTTGCTCGACTTCCCGACCATCGCGTTCCTGCAAAC 
GGGCTACAACGTGTACACCCTGCAGCAGGCCGCGCGGCGGTCGTGGCGGATCGGGCAGAAGCACCCGGTGAGGGTGGTGT 
TCTTCGG CT ACG CCGG C AG CTCGCAG ATC ACCTGCTT AC AG CTG ATGGCCAAG AAGATCG CTGTGG CTC AG AG CACGTCG 
GGAGACGTTCCCGAGTCAGGTCTCGACTCGTTGAACCAGGATGGGGATTCTGTGGAGATGGCGTTGGCACGACAACTCAT 
CGCAGCATGA 
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Protein sequence: (SEQ ID NO: 164) 
MNALTQPAALAASHLNINLTDFIDEFGDELLESLN^ 

EQAG 1 1 NAEMGTG KTMMAI AVAAVMHAAG YRRTLVVS P PHLVYKWRRE I LET I PAARVWVLNG FDTLLKLLKLRDQMGDA 
YDG RQE FF I LG R VRMRMG FHWRLACWKKRAAGGQLLAAC PDCGQ VLEDLEGNL VTVE E F E RG D RRRTCS S CRGALWTL I R 
PGKPDGGNRRATI LKSMCRI PTI GPVRAERLLNDFGEDFLATMLWNVSEFINLMDAKGNF^FSDRQAKRMERSMANI EF 
G FG EGG YQP T E F I KR YL PDG Y FDLLVLD EGH E YKN SG S AQGQ AMG VLAAKARKTVLLTG TLMGG YADDLF YLLFR I LTQR 
MIEDGYRPNARGS^PAAMSFMRDHGVLKDIYTERDGDSHCT 
GNVLPDYQEEFIDVPMAPEQASAYQRIJATLTAELRQAI^RDTTLLGVVI^^ 

AIFGDEQLIPKEQVLVDLCFEEKAKGRKV1AYTVYSGTRDTTSRLKKVLEQSGLKVAVLRASVDTARREDWILDQVDRGI 
DVLITNPELVKTGLDLLDFPTIAFLQTGYNVYTLQQAARRSWRIGQKHPVRWFFGYAGSSQITCLQLMAKKIAVAQSTS 
GDVPESGLDSLNQDGDSVEMALARQLIAA 

RL064 

DNA sequence: (SEQ ID NO: 43) 

ATGGCCCTCATG TTC CCG CGCTTGG CG CG C AACTTTGC ACGC AACGG CTACTTCCCT ACCG ATG AGGTC 
ACCCTCGAACGCGCTCTGCAGGCCCTCACTCTTGCCCCGTCGGGAAGGATGAGGATCTGTGACCCCTGCGCCGGTGAGGG 
TGTTGCCCTGGCTGAGGCAGCACACACCCTCGGCCGCGATCAGGTCCAAGCCCTCGCTGTCGAGTACGACCGCGAGCGCG 
~ CGGACCATGCCCGAGGATTGCTTGAGGGAGTGCTGGACAGTGACCTTTTCGACAGGATGATCAGCAGGCAGTCGTTCGGA 
CTG CTCTGGCTC AAC CCG CCTTATGG CG AC CTGGTGGCGG ACC ACTCCGGTG CGTCG C AGT ACC AGGGC AG CGGCCGCCG 
GCGTCTGGAGAAAGCGTTCTACCAGCGCTGCCTGCCGTTGCTGCAGTACGGCGGCGTCATGGTTCTGATTGTTCCTCACT 
ACGTCTTGGACGATGAGCTGACTGGCTGGTTGAGCAACCACTTCACCGGCCTGCGCATCTACGCAGCCGCGGATCCTACC 
TTCAAGCAGGTGGTGATCTTCGGCATCCGGGTCCGTCGGCAGGACCTGGCCCGGGCGGACGCCAATCAGGTGAGGTCTCG 
CCTGCAGGCGATCGGAGCGGGCCAGGAAAAGGCCGAGGAAATTCCAGCGGCTTGGCCGTGGGAACCCTACGTGGTTCTGC 
CGGCCACCAGCGAGCTGGAGCACTTCTACCGAGTAACCCTGGAGCCGGAGCAGTTCGCCGGTGAAATCCAGCGGCTGCGA 
GGTCTCTGGCCTGACTTCAACCTGCACTTCGCGCAAGCGGGGCTGCAGCCGCGCCCACCAGTCCGCGAGCTGTCTCGCTG 
GCACCTGGCCCTGGCCCTGGCCGCCGGCGCGATATCTGGCGTCGTGCGATCGAAGTCGGGCCGGATCCTGGTCGTGAAGG 
GTG AC AC CT AC AAGG AC AAGG TC CG CAAG ACCG AATTC ACCG AGGACGACGACGG C AAC ATC ACCG AGGTGAGGATCCTC 
ACCGACCGTTTCATCCCGATCATCCGGGCATGGGAAATGACACCCTCCTCGGTCAATCAGGGCCGCG^GCTGACCATCAG 
CTCCTCGGCCGCGACCACGGAAGAAGCTGAAGAGCCCCAACCTGAGCCGGCCCCCGCACCCGCACCGCTGCTGATCAGCC 
CTGGCCGGGTCGTAATGACCGCAGCCGTGAGCCACCTGGTGGAAACCGGTCAACTCAACCCAGCGCCTTTGCTGAAACGC 
CATCTGGCGGGAGATTGGGGAACGCTGGACCAGGAAGACTGGAACACCAACCAGAGAGCCCTGAAGTTCGGCGATCGGCT 

GCTGTCGTCCTACGACATCGACGCCGGCGACGAATCCAGGCTCTGGATCATCACTGAGGCAGACCGCAGCTCAACCACGC 
TTTTGCTCCCTAGCGATTACTGA 

Protein sequence: (SEQ ID NO: 165) 
MALMFPRIARNFARNGYFPTDEVTLERALQALTI^PSGRMRIC^ 

RGLLDRVLHSDLFDTMISRQSFGLLWLNPPYGDLVADHSGASQYQGSGRRRLEKAFYQRCLPLLQYGGVMVLIVPHYVLD 

DELTGWLSNHFTGLRIYAAADPTFKQWIFGIRVRRQDLARADANQVRSRLQAIGAGQEKAEEIPAAWPWEPYWLPATS 

ELEHFYRVTLEPEQFAGEIQRLRGLWPDFNLHFAQAGLQPRPPWELSRVmiAIAlAAGAISGVVRSKSGRILVVKGDTY 

KDKVRKTEFTEDDDGNITEVRILTDRFIPIIRAWEMTPSSVNQGRVLTISSSAATTEEAEEPQPEPAPAPAPLLISPGRV 

VMTAAVSHLVETGQLNPAPLLKRHLAGDWGTLDQEDWNTNQRALKFGDRLLSSYDIDAGDESRLWIITEADRSSTTLLLP 
SDY. 

RL065 

DNA sequence: (SEQ ID NO: 44) 

ATGCCCAGTCCCACCCCGCTCTACCAGATCGAAGAGTGTCCAGACCTGTACGTCGACGCCTGCGTGTGCGACGAGCAGTG 
CAACCTGGTCTTTCTTTCGGCCTGGGGCCGCGACACCGTGACACAAGAGTTCCTGGCCAGGCTGACGCTGGGCCGGGAAG 
AAAATGGCATCGACCATTTCCACATCATCGTGGACGGCCGCCGCTTACCTGTCTTCCCAAACCAGGATCTCCTGGAGAAA 
CGCACCACCCGTCAGTTCCGCGGCACGTTGTTCGGCAGCCTGCTCAATCTTTGGCTGTTCGATCGGCGCGCCTCGGCGCC 
CGACCGAGGCAATCACCTCGCCTTCGCACTCCTGCAGCGCGATGAGGATCCACACCAGAGGCTCTGGCCGCTGGTGATGG 
AAACCTGTCCGCTCCCCCTCCTGCAGCACTGGCGCGAGCCGGTGATGGAGGTTCTCACCCAGCACCAGATGTTGACGGCC 
CTACCCGGGACGATCGGCAACGTCTGCGCCTGGCGACTCGCCCTGCGGGTCGACGTGCTTGAGCCCACCCTCGGTGAGGT 
AAT CCG CG AAAG C ATTCTT AC CACCG ATG CTCAGGCG C AAGCCTGA 

Protein sequence: (SEQ ID NO: 166) 

MPSPTPLYQIEECPDLYVDACVCDEQCNLVFLSAWGRDTVTQEFLARLTLGREENGIDHFHIIVDGRRLPVFPNQDLLEK 
RTTRQFRGTLFGSLLNLWLFDRRASAPDRGNHLAFALLQRDEDPHQRLWPLVMETCPLPLLQHWREPVMEVLTQHQMLTA 
LPGTI GNVCAWRLALRVDVLEPTLGEVI RE S I LTTD AQAQA 
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RL066 

DNA sequence : (SEQ ID NO: 45) 

ATGAATCCATTGTTCACCAACCTCACCCAGGAAACCCTCGCCTACCTCGAGGACCAACTGTCCAACAACGACGTCGCCGG 
CGACGACGAGCTCATCGACTTGTTCATCGAGGAGCTGTCGCTGACCTTGGAGCAGGCGGAAGCGGCTGTCGCGCTACGCG 
ATCAGTACCTCTGCCAGGTCTTCCTGATCGGCCAAGGGCCGCTGCACCAAGCCGATGGACTCAGCTTCGACCCTCACACC 
AAGAGCGTTCGGTAG 

Protein sequence: (SEQ ID NO: 167) 

MNPLFTNLTQETLAYLEDQLSNNDVAGDDELIDLFIEELSLTLEQAEAAVALRDQYLCQVFLIGQGPLHQADGLSFDPHT 
KSVR 



RL067 

DNA sequence: (SEQ ID NO: 46) 

ATGGGATGGCTTTTCTCACATCAGACGAAGGAAGACCTGCTGCGTGAGCTGCTGGCCCCAACCAGTACCTTCGCAGGCAG 
C AC CG AGGTGCTGG C AC ACGCAG T CTCCGG C AATG AACTTTGG AC TGTCGTAAAACG AACTTTTCACCTTGCCGG ATTCT 
ATTTCGGC)U^GCCGGCCGGTCACTCGATCACCATGA 

~M^ij^&!T^^tSStji^P^CGCCGG C C CGTTCT ACT ACGG CTGTC CG CTGGAGTTC CTGGACCTGGCTC ACG ATGAG ATCAA 
CCAGGAATGGCGTAAACGCCTGACGCACGAACACCAAGCCTGA 

Protein sequence: (SEQ ID NO: 168) 

MGWLFSHQTKEDLLRELI^PTSTFAGSTEVIAHAVSGNELWTWKRTFHIAGFYFGKPAGHSITMIEIJILLDCSAGQWGY 
KTIPESAGPFYYGCPLEFLDIAHDEINQEWRKRLTHEHQA 

JU/06fe 

DNA sequence: (SEQ ID NO: 47) 

ATGAAATCGATCTACAACACCCCAGGCTTCAGCGAGGAGTTGTTGCTGGTTTGCGCCTCGCTGCGCGAGGTCGGACTGGA 
CAATCTGGCTGACCAGTTCCGCGCGGCAGTGTTCGACCGATCCGTCGTCGACCAGGCCATCATCGCACTGCGTGAGCGGG 
TGAAGACCCCTTCGCCGGAGCATGCGGCCGACAACGAGCCCTGGTTGTACTGCGACTGGCAGGCCAGGCAAACAGCTTAC 
CGGCTCCTCCAGCGCCTTGAGCGCGCAACACGCTGA 

Protein sequence: (SEQ ID NO: 169) 

MKSIYNTPGFSEELLLVCASLREVGLDNI^QFRAAVFDRSVVDQAIIALRERVKTPSPEHAADNEPWLYCDWQARQTAY 
RLLQRLERATR 

DNA sequence: (SEQ ID NO: 4 8) 

ATCCCCTCACCACGATCCCGCTTCGGCGGGATCATCCTTTTCGCAGGTCATACCATGATCACAGTTCCCGGACAGTTGGC 
CATTCGAACCATCAACGGTCGCTATGGCGAGTTCAATGTGGGAAAACTCTGGACTTCGATCGGGGAGTTCATCATCAAGG 
ATGCCTTCCTGGATCAACACACCGAAGGCAAGTACCGCGGTGATTTCGTCATCGCCAATATCCGCCCCCACCACTACTCC 
GCCGGCGGTCGGCTAGTCATCGAGATCCGCGCCATAGTGGACAGCATGACGCTGAACGATATGGACAGCCTCAGCGACGA 
GGAGGTAGAGCGTCTTTCCGGCAATGAGGTGGATCCGCTCGACGAAGTGCCCGAGATCCAGCTCCCCACAGTAGTACCGG 
CGATACCACCAAAGTCGCCGTCACCCCAGAAGTCGAAGCCTCTGTGCCTCGCTGCAACCAGGGACGCGCCTTTCGGTATG 
GACACTCCGGCTCCTGCAGAGCAGGCCGCCTCTCTGGACACAGACGCGGATGCAGAACTGTTCGGGACGGTCTGGCCGCT 
AGGCGAAATCGTCAAGCTGGACACCACGGTCGACCGCAAGCGACTACGCCAACAGTGCGTGCGACTCGGCGCGCTGGGCT 
ATGAGCTCGACTTCAAACAACAGGTGTGGACCCGCAAGGAGGCCGCATGA 



Protein sequence: (SEQ ID NO: 170) 

IPSPRSRFGGIILFAGHTMITVPGQLAIRTINGRYGEFNVGKLV9TSIGEFIIKDAFLDQHTEGKYRGDFVIANIRPHHYS 
AGGRLVIEIRAIVDSMTLNDMDSLSDEEVERLSGNEVDPLDEVPEIQLPTWPAIP 



RL070 

DNA sequence: (SEQ ID NO: 49) 

ATGACCTCTCTCAACAACCACTCCAGCGCAGGTCACACTGCTGCGTACCTCAAACTCCCGATCGTTCTCACCAACGCGGC 
CTGGCTGCGCCTGGTCTATCTCGCCAACCCTGCCAGGGTCGACGAGATGGGCACCCGGCTGGCCAGTGTCGTTCAAACCG 
CCTGGCAGGAGCTTTCTCTCCAGCCGACCGCGAAGCACATCCAATTCCACCTGTACCACAAGGAGGAAGAGGGGCAGGAC 
CGCGCGCTCGCGCTGCTGGTTCTCTCGATAGTCGAGCCGTCCGATGAGCCTTCCTACCTGCGCATCGAGTTGCAGGAAGA 
GTGCCTCGCCGAACACCCGGTTACCGAGTAG 

Protein sequence: (SEQ ID NO: 171) 
MTSLNNHSSAGHTAAYLKLPIVLTNAAWLRLVYL^ 
RALALLVLSI VEPSDEPSYLRI ELQEECLAEHPVTE . 
PKSPSPQKSKPLCLAATRDAPFGM 

DT P AP AE QAAS LDTD AD AE LFGT VW PLGE I VKLDTTVDRKRLRQQCVRLG ALG YE LD FKQQ VWTRKEAA 
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Rb071 

DNA sequence: (SEQ ID NO: 50) 

ATGACTCAACTCAACCCGTTTATTCGCGGCTATGAGAGTTTCCGCATCGAGCGAAACCTGCAGATCACTGACGAAGGCAA 
CAATCTACCGTGCTACCGCGCTCTGCATGAAACCCAGCAGCACCTCCCAGACGAATATTTTCAGTGCGAGCTGTGCTACT 
TCAATAACGATTTCGCCGTGGTAGTCCAAGAGTTAGACGATGAAAGAGTTGAAAAATGCCCTCACCAAGGAATAGTGAGA 
AACGTACTTTACAGCATCTACGGTGAGCAGGACGGCAGAAAAAAGCTTATCGGAGATCAATACTCACTGACCGAAGCCGA 
GAGTGTCGTTCGATACCTTTCGTTCGGCGGCGGTTATAACCCCTGCTGGGAGATCAGAAAAACACATCTACCCATCAGCG 
CGTGGAATAGCCTCTACGAAAGGTTCTCGACCAAGATGCCAATCCGCTTGCCCTCGGTGTTGGTATCGCTCTTCTGGTGT 
AACG AG CACGG TG CCG TGGG CTTT CG CTTGC AC AACACCC CTTGG ACGG ATG AGTGTCTGG AGATC CTGG AG ATG ACCGC 
AGCCGCTCTTCGACAAGAACAGCTTGCCTTCGGCCTCGACGAACACCTTGTCGATCTGCTTCACCTCGCGGGACAAGCAG 
ACATTCGG CTC CTGGT ACTTG ATC C ATTCGCG CC C ACGCTC AAGGG CCTGCCGCTTTATG ACG ATTG A 



Protein sequence (SEQ ID NO: 172) 

MTQLNPFIRGYESFRIERNLQITDEGNNLPCYRALHETQQHLPDEYFQCELCYFNNDFAVVVQELDDERVEKCPHQGIVR 
NVLYSIYGEQDGRKKLIGDQYSLTEAESVVRYLSFGGGYNPCWEIRKTHLPISAWNSLYERFSTKMPIRLPSVLVSLFWC 
NEHGAVGFRLHNTPWTDECLEILEMTAAALRQEQ1AFGLDEHLVDLLH1AGQADIRLLVLDPFAPTLKGLPLYDD. 




DNA sequence: (SEQ ID NO: 51) 

ATGGGACTGGTGTTTCCTACCGAAAGGAGAATCACCATGCAATACGGAAAGCTGGCGCTCGCCCATCTCAGCCTGGAACT 
GCCGTTGCAGGTACTTATGAATAAGAACCGTGCTTACTACATCGGCACTTCTGACGAAGAAGGACCAGCCTCGCGCGAGT 
CGGTTGAATATTACCCCTCACGCGAACTTGCCCAACAGGCATTAGACCACGGCACTTGGACGCAACTGGAATATTAA 



Protein sequence: (SEQ ID NO: 173) 

MGLVFPTERRITMQYGKIAIAHLSLELPLQVLMNKNRAYYIGTSDEEGPASRESVEYYPSRELAQQALDHGTVJTQLEY. 



DNA sequence: (SEQ ID NO: 52) 

ATGGGAAATGTTTGGCGATTATGCCAGGGCAGATACCTGGGCATTGTTGTTGGCCAGGAACAGCCAGGCGAAGTTGCAGA 
ACTGACTGCTGAGCAGCAGCTCGTCCTCGACGTCGCTGAGGCTAACCTCCTCAACTTCCGGCAGGGCGGGCAGTTCTACG 
ATTTGGATGTTGCTCATGATGATCTCCAGATAATGGAGAACACCACGCCCTGGGGGGAGATGGTGCCCCCCGGATGGGTA 
TG CGATG AAG AGTG G CG CATAGCG TAG 

Protein sequence: (SEQ ID NO: 174) 

MGNWRLCQGRYIX3IVVGQEQPGEVAELTAEQQLVLDVAEANLLNFRQGGQFYDLDVAHDDLQIMENTTPWGEMVPPGWV 
CDEEWRIA . 

Ri§7* 

DNA sequence: (SEQ ID NO: 53) 

CTGACGGGCAAGGTGTTTCTCCGCTTTCGCTTACGAAACTGGAGAATCATCATGAGCAACAACACCCAAGCCCAAGAAGC 
CAAGTATTTCGACCTGCACACCACCGGTATCGGCTACCTCAATCGCATCCGCGAGGTACCGATCCGCCGAGGTGAACCAT 
TCCTCGCCGTAACCGTCGCAGCCCTCCATGGCGCGGCAGACAGCGTGGAATACTCCTACATCGACTGCAAAGTGGTCGGC 
GCCCAGGCTGAAAAGCTTGTCCGCCGTTGCAAGGAAGCAGTCGAGGCCAAGAAGAAGGTTCTGATTTCCTTCCGTATCGG 
CGATATCTGGGCGGATCCCTTCATCCACCAGAAAGGCGAGAAACAAGGCAAGCCCGACGCAAGCCTCAAAGGCCGGCTGC 
TCTTCATCTCCTGGATCAAAGTGGATGGCACCACCGTCTACGATGCGAAGGAAGAAGCTGAAAAAGCCCAGCAAGGCAAA 
GGCGAACCTCAAGGTGAGCCCGCAGCCCCCGCTGAGCACGCTGAACAAGCCGCTGCTTGA 



Protein sequence: (SEQ ID NO: 175) 
LTGKVFLRFRLRIWRIIMSNNTQAQEAKYFDLHTTGIGYI^^ 

AQAEKLVRRCKEAVEAKKKVLISFRIGDIWADPFIHQKGEKQGKPDASLKGRLLFISWIKVDGTTVYDAKEEAEKAQQGK 
GEPQGEPAAPAEHAEQAAA . 

DNA sequence: (SEQ ID NO: 54) 

ATGTCCAAGCAATCCACCAGCTTCGAAATCGGCTTTGCCCTCGGCAGTGTTGTGCGTGAGTTCCGCAGAGCGCTCAGTCG 
CCCTCCGGTCGTAGTGCAAGCACAAGCGCCGGTTGCGTTGAGAGTCCAGCGCATCGATCCTGCCTTCCTGGCCGGCCCGA 
CCGCTGG CG AG CT AG AAC AC ATCAG CG AC ATC CC AGCCATCGTC CGG CTG AAG AAGGTCAACCTG AATG ACTGGTATCTA 
GCCAATACGCGCGAGGTGCAAAAGCCCAAGCGCGCACGCAAACCCAAGCCGGCCAAGGCGACCGCCAAAGCTGAAACGCC 
AGTCAGGAAGGAGCTCAAGATGGGTTCCCTCGACCATTTGATTGCACCCAACTCCGAAAGCGAAATGGGGAGGCCCCCTC 
TCCAGTTAGAGTCCCTGAACGATCATGAGATTGCTCTTTTGCCAGCACCTCCTGGTAGCGCAGTCTCTTGGGAACTCCAT 
CGGCGTACTCAGGAGCAATACCAACAACGCTGGCAGGACTACTTGTCCACCATGACGGATGAACAAGTAGCTGCTCTCGG 
CCGCTAA 
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Protein sequence: (SEQ ID NO: 176) 

MS KQSTS FE I G FALG S WRE FRRALSRPPVWQAQAPVALRVQRI D PAFLAG PTAGELEH I SD I PAI VRLKKVNLNDWYL 
ANTREVQKPKRARKPKPAKATAKAETPVRKELKMGSLDHLIAPNSESEMGRPPLQLESLNDHEIALLPAPPGSAVSWELH 
RRTQEQYQQRWQDYLSTMTDEQVAALGR . 

RL076 

DNA sequence: (SEQ ID NO: 55) 

ATGGTGTTTCTCCTGCAGGTTGAGGGTGCGGAGAAAACACTGGCCCTGGCGGGGAAGTGGATTCCCCGCTGGGTTGCGGA 
AGGG AGCTTCT ATCG ACCG AGG CCG AC CG ACCG CGCT ACCAGAAG CT ATGCGGTCTTGGGTTGGAT CAACACGGTGGGCT 
GTGCTGCAGCATTTCGGATCCGAGCTGCATGGGGGCATGTCGCTGACAACGTCAGCAGATCACGCGTTCATCATCGAAGC 
GGGGGGCGAAAGTGTCAAGGTCAAGCAGGAGGGGGAGCGGATGCAGCGGGCGGAGAGCGAGGGCGGAAGAGCGCGGCTGG 
TAGAAACCCTGTCAAAGGTTTCCCCAGCCGTGTCTGGAAGGGGAGTCAAGTGAGCCACCTGTGGTTGAATCGTCGATCCC 
TGGGCATTGATCGTCTCGATCCCATCACCCGGCCATTATCGTGGCTTGGCCAGCAAACAGTAGGCACGCATCCGCGTACA 
AAGGGAGCCCTGCGTATCACCGGCGGGCCACCGGCAGGGAGAAGGATCCCGATGGGTAGCCTGATAGTCCTGGAGCAGGA 
GCATCAGGCTACCCATGGAGAGGGGAAAAGGAGGGGCCGTAACACCAGTACGACCCTTAAATCGAGGAAACACCGAACCT 
CTTGA 

Protein sequence: (SEQ ID NO: 177) ._ . 

-MVFLLQVEG AEKTLALiAG KW I-PRWVAEGS FYRPRPTDRAT-RS YAVLGWINT-VGCAAAFRI-RAAWGHVADNVSRS RVHHRS 
GGRKCQGQAX5GGADAAGGERGRKSAAGRNPVKGFPSRWKGSQVSHLWLNRRSLGIDRLDPI^ 
KGALRITGGPPAGRRI PMGSLIVLEQEHQATHGEGKRRGRNTSTTLKSRKHRTS . 



Ri07.7 

DNA sequence: (SEQ ID NO: 56) 

ATGCCGCTGATGTGGATCGTCCTGGTGCTCGCGCTCATCACCGGGACCTGGCTGAGTGTACAAAGCGACCACGCGACCTC 
GAGCGCCGAACTGGCCGAGGTCGACACCCTGGCCAGGAGCTTGCTGCTCTTCCGGTCCAGTCTGGCGGAGTACGCACACG 
CCAACCCCGGTTTCACCGGTTCGCCGGCGGACTCCGCTCTTGGTTTACCGGCCTGGTTCCGCAAGCCAGCGCGGCTTCAG 
GGCTACATCGCCGCCGGCACCAGCTACGCCTTCATCGCCTCGCCGCCGGCGGGGCTGGCGGCGGCCGTGGATGCTGGTAC 
GGAATCCGACCTGGTTGGCGTCAGGCGCAACGGCCAGTTAGTCACGCGCCGCCTCGGAGCCACTGTCATTGCGCTCCCTA 
CGCCCATCCCCGAGGGCGCGGTGGTCGCGGTCAAATAA 

Protein sequence: (SEQ ID NO: 178) 

M PLMW I VLVLAL I TGTWLS VQS DHATS S AE LAE VDTLAR S LLLFR S SLAE YAHANPG FTG S PAD S ALG L P AWFRK P ARLQ 
GYIAAGTSYAFIASPPAGLAAAVDAGTESDLVGVRRNGQLVTRRLGATVIALPTPIPEGAWAVK. 

RL078 

DNA sequence: (SEQ ID NO: 57) 

ATGAGGAGTACGCGCAGCAGTGGATTCATCTCGATCGAACTGATGATCGCCCTCGTCGTGATCGCCATCGCGACCGCCGG 
TGGCATATCGGTCCTGATGAGCTACCTGGACGGCTTGGACGAGCAGCACGCGGCCCAGCAGCAACAGCAGGTGGCCAAGG 
CAGCGGAGAAGTACCTGAAGGACAACTTCAGCACGGTTCTGGCCAGCGCCGGCGCCACGGCCCCGGCGGTGATCACCGTC 
CCGATGCTGCGCAACACCCGTTACCTGCCCGCAGGCTTCCGCGACACCAACATCTACGGCCAGCAATACCAGGTCCTGGC 
CCGCAAGCCGGCGGCCAACCAGCTCGAAACGCTGATCGTGACCACGGGTGGACAGGTAGCTTCCGAACTCTCGATCCGCC 
GGATCGCGCAGCTCATGGGAGCCACCGGGGGCTACATCTCGAAAACCAACACCAGTATCGCCCAGGGCGCCGCCTGGCAG 
GTGGCCTTAAGCAATTTCGGTAGCGCTCCCGGCGCTGGACATCTGGCGACGGCGCTGTTCTTCCAGGACGGCGCCATCGC 
CAACGAGTACCTCTACCGCAATGCCGTCCCGGGTCATCCTGAACTCAACCGGATGAATACCACGCTGGACATGGGAGGCA 
ACAATATCGCCGCAGCCGGGGCGATCACGGCCAGCGGCAACATCACCACCAGCGCGGACATCAGCGCGCGCAACGTGACA 
GCCACTGGTACGGTGAAAGCCGGCACTGCTGACGTCGCCGGCGAGACGTACACCGGAGGCTGGTTCAGGACCCGTGGTGA 
CACGGGCTGGTACAACGAGAAATGGGGCGGCGGCTGGTACATGAGCGACAGCACCTGGGTGCGCTCCTGGATGAACAAGA 
ACGTCTACACCGGCGGCGAGATGAAAGCGGGCAAACTCACCGCCGAGGGCCGGACGGAAGTCGGCGAGTACCTACAGCTC 
AAAGGCGTGGCCACCGAAGGAGCCAACTGCTCGCCGAACGGGCTGGCAGGCATCACCAGCACCGGACTCTGGCTGTCCTG 
CCAAAACGGGAAATGGGGACGAACCGCCGCCTCCATGCGCCTGAACACCACGGCCGGCGTGATCAAGGACTGGTGTACGT 
TGCATGGTCAGGATAGCGCCATGGTG71ACTACGACTACGTCCGCTACGCGATCACCTGCGGCGGCCGATTCTGCGCAGTG 
GGCTTCAACCAGACATTTGGCACCAACTACTCGTTCGGGCTAATCACTGAGATCGGCCCAGGCTTCAACTACCCGGAACC 
CTACAAGACCCCCGACTCGACCAACGTGACCGTTACCTGCGTGAACTAG 

Protein sequence: (SEQ ID NO: 179) 

MRSTRSSGFISIELMIALWIAIATAGGISVLMSYLDGLDEQHAAQQQQQVAKAAEKYLKDNFSTVLASAGATAPAVITV 
PMLRNTRYLPAGFRDTNIYGQQYQVLARKPAANQLETLIVTTGGQVASELSIRRIAQLMGATGGYISKTNTSIAQGAAWQ 
VALSNFGSAPGAGHLATALFFQDGAIANEYLYRNAVPGHPELNRMNTTLDMGGNNIAAAGAITASGNITTSADISARNVT 
ATGTVKAGTADVAGETYTGGWFRTRGDTGWYNEKWGGGWYMSDSTVAmSV^KNVYTGGEMKAGKLTAEGRTEVGEYLQL 
KGVATEG ANC S PNGLAG I T STGLWLSCQNGKWGRTAAS MRLNTT AGVI KDWCTLHGQD S AMVNYDYVR YAI TCGGRFCAV 
GFNQTFGTNYSFGLITEIGPGFNYPEPYKTPDSTNVTVTCVN. 
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RLQ79 

DNA sequence: (SEQ ID NO: 58) 

GTGAGTGTGAACCCGATCATCCAGGCTCAGTTCGTCGACCTCTACCTCGGTGAAGGCTTCGCCGACGTGAAAGGCCTGGC 
CGGCGCCGGCGCGCGCCGAGTCGAAGTGCCTCGCGAGTGGGAGTCGCACGTCCAGGAACTGCTCCAGATCTGCAGGCAAA 
CGCTGGAGGAGCTGCAGGACCCTGAGTTCGCCATCGTCGTCGACGGCGTTCTGCTTCGCGTCACCCTCCTCGAAGACGCT 
TTCAGTGGCAGCGTCTTCGTGCTGCGCCGGTCGAGCGCCCAATTGCGGGAGTTCCAAGAGATCGGCTATCCGAGCGAAGT 
GGTTTCCGCACTGATGGATCCGCAGTTGCAGGGCCTGGTCCTGTTCTGCGGCGAGATGGCGACAGGCAAGACCAGCTCCG 
CCGCCTCTCTGCTCCTGGCCCGCCTGCAGGAGTTGGGCGGGGTGGGCTGCGCCGTCGAGGACCCGCAGGAAACCAACCTC 
AGCGGTCAGCATGGGCTCGGCCGCTGCATCCAGGTCAGAACCTCACGGCGCTCAGGCGGATACAGCGAGGCCCTGCTGCG 
C ACG CTG CGGG CCGG CG C CG ACCTGG TG CTG ATTGG CG AG ATC CGCG ACG AG G ACACCG CCTACCAGGC CTGCAAGG CCT 
CTCTG ACCGG C AGC CTGG TG AT CGC C ACCATT C ACG CG AAAAGCTGTCATC AGGCGATCGAG CGCTTGGTGACGCTCGCC 
CAGCCACTGGCGAGAAACGCCTACGACGTGGTTGCCGAAGGCATCCAAGCTGTGATCTGCCAAGCGCTGGAGAGCGATGG 
TTCCTCGCGCCGCCTGACCGCCGAGCCACTGCTGTTCACTGGCGACGACGGCCCGTCCATGCGCGACAAGATCCGCCGAA 
AGGAGGCTCATCTGCTGCAGGACGACCAAGCTCGCCAGTCCCGGCAAAGCCTATGGAGATAA 



Protein sequence: (SEQ ID NO: 180) 

VSVNPIIQAQFVDLYLGEGFADVKGUVGAGARRVEVPREWESHVQELLQICRQTLEELQDPEFAIVVDGVLLRVTLLEDA 
FSGSVFVLRRSSAQLREFQEIGYPSEWSALMDPQLQGLVLFCGEMATGKTSSAASLLLARLQELGGVGCAVEDPQETNL 
SGQHGLGRCIQVRTSRRSGGYSEALLRTLRAGADLVLIGEIRDEDTAYQACKASLTGSLVIATIHAKSCHQAIERLVTLA 
QPLARNAYDWAEGIQAVICQALESDGSSRRLTAEPLLFTGDDGPSMRDKIRRKEAHLLQDDQARQSRQSLWR. 



DNA sequence: {SEQ ID NO: 59) 

ATGAGCACTACGCAACGCACTTCCCGTCCGACGCAGGGCGGTTTCGTTTCCATCGAGATGATCATCGTGCTGATCATCAT 
CGCCATCGGGGTCGGCCTGGGCCTGGCCGCAGCGGCTGGAATGTTCAGTTCGTCCAACGCCAACGAGGAACAACGCAACA 
TCAGCGTCATTGCGGCCAACGCACGCGCCCTGAAGACCTCTTCGGGCTACGGCTCCAGCGGTACCAACCTGATCCCCAGC 
CTGATCGCAATCAACGGCGTGCCGAAGAACATGAGTGTCTCCTCCGGCGTCGTCTACAACGTCTACGGCGGATCGGTCAC 
TGTCTCGTCCACCGGCATGGGCTTCTCGATCACCACCAGCAAGTTGCCCCAGGACGCCTGTATCACGCTGGCCACCAAGA 
T CG CG AAG AACACCTTCG AG C AG AC CAAAATCAAC AG CGG ATC CTCG ATC ACCGG AG AAGTG ACCACCGCAG CCGCG ACC 
CAGG CCTG CAGCAG CG ACAG CAACAG CATTAC CTGG ACCTATAGTTCGTGA 

Protein sequence: (SEQ ID NO: 181) 

MSTTQRTSRPTQGGFVSIEMIIVLIIIAIGVGLGLAAAAGMFSSSNANEEQRNISVIAANARALKTSSGYGSSGTNLIPS 
L I AI NG V P KNMS VS S G WYNVYGG S VTVS STGMGFSITTS KLPQD AC I TLAT K I AKNT F EQTKI N S G S S I TG E VTT AAAT 
QACS SDSNS I T WTYS S 



RL081 

DNA sequence: (SEQ ID NO: 60) 

ATGGGGGGCTTCTGGGAGCAGTTGCAGTTCGCCTTCTACAGCAAGCAGTTCGGCCGCAAGGAACGCCTGCAGTTCTACGA 
AAGCATGTCCACCCTGCTCGAAAACGGGGTCCCGTTGAAGGATGCTGTGGCAGAGGTGCACAAGATCTTCGCTCATGAGG 
GGCAGCATCCGTTTCATCCGGTGGCCATCGCCAGTCGCGAAGCGCTGATGGGGCTGTCCAACGGCAAGCGTCTGGCCACC 
GCCATGGCGCTCTATCTCCCCGCCCAGGAGCGAGCGTTGATCGAGGCCGGCGAGATGAGCGGCAACCTGGTTCAGGCCAT 
GGGCGATGCCGTCTCCCTGGTCGAGGCCCAGGCCAGGATCCGCGCCACCATCTGGCAGGCGCTGCTCTACCCCTCGGCGC 
TGTCCGCCATGATGGTGTTCCTGCTGTGCATCGTGGCCTATCGCATGGTCCCCAGCCTGGCCAGGCTCTCCGACCCAGTC 
ACCTGGACCGGCCCGCTCGCCACGCTCAACGCCATTGCCAGCTTCGTCACAGGACCTGGTATCTACGTTCTGGTCGCCGT 
CATCACCCTCACGGTGGTGGTCATCGTCACGTTGCCGACCTACCGCTGGAAAGGCCGGGTCTGGCTGGACCGGACGCTGC 
CGCCCTGGTCCATCTACCGCATGCTCCAGGGCACCACCTTCCTGCTGAACATGGCGGTCATGCTCAACGCCGGCATACGC 
CCCTACGACAGCCTGGCCAGCATGATCAAGATCTCCCCGCCCTGGCTGAAGCAGCGCTTGGAAGCTGCCCGCTACGGCGT 
GGGCCTGGGCCAGAACTTGGGTGTTGCCCTTCGCAGCGCCGGTCACGATTTCCCCGACCGGCAGGCCATCCAGTACCTGT 
G CATC CTCGCC AACCGGGG AGGCTTCTCCG AGG CGCTGGT CAAGTTCAGCCG CCG CTGGCAGG AG ACCAGCCTCAAGCAG 
ATCGAGCTGGCCGCCGGGCTGGTGAAGAACTTCGCCCTGATCTTCATCGGCGCGCTGATGATCCTGGTCCTGCTCGGCGC 
CTACCAGGCACAGCAGCTCATCCAATCCATGAACCACTGA 

Protein sequence: (SEQ ID NO: 182) 

MGGFWEQLQFAFYSKQFGRKERLQFYESMSTLLENGVPLKDAVAEVHKIFAHEGQHPFHPVAIASREALMGLSNGKRLAT 
AMALYLPAQERALI EAGEMSGNLVQAMGDAVSLVEAQARIRATIWQALLYPSALSAMIWFLLCIVAYRMVPSLARLSDPV 
TWTGPIJVTLNAIASFVTGPGIYVLVAVITLTVWIVTLPT^ 

' PYDSLASMI KI S PPWLKQRLEAARYGVGLGQNLGVALRSAGHDFPDRQAI QYLCI LANRGGFSEALVKFSRRWQETS LKQ 

__I ELAAGLVKNFALI FIGALMI LVLLGAYQAQQLI QSMNH . 
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RL082 

DNA sequence: (SEQ ID NO: 61) 

ATGACGAACCTTCAGATTGCCGCGCTTGCGCAGCCCTCCATGGTGACCCAACTGCTCACCGCCGACGGTGGTGAATGGGA 
GGTATCGAAGCACCTGCAGGAAATCATGGCTCTGGCTGCCGACGGCACGCTCTATCTATCGGAGAGCCACCAGAACGACA 
TACACGTTCTGTCGTTCATCGACCGTCTCGATCGCCGTGGCTTCCGATACCAGCTCAACCTCACCGACCTGCAGACCATT 
CACCAGCTTTACCGCGCCGTCGCCATGGACGGCCTGGTCGATAGCGATGGCCAGCGCGCCACCCAGATGCAGGAGCGCGT 
GGTCAAGATCATTCGTAAGGCCACTGAGCTGCGCGCCAGTGACGTGCATTTCGTCGTGAGTCCCGCCGGCACCGGCAGCA 
AGATCCGTTTCCGCGTCGACGGCCTGCTGAAGACCGTCGAGCAGTTCCGCAGCCAGGAGCTGCACGAACTCTGTGCAACC 
ATCTACCAATCCATGTGCGACGTGGCCGAGCCACTGTTCAAGCCGCAACTGGACCAGGACGCGCGGATGAGCCAGACCTT 
CGTCGAGAAGCTCAACCTGTTCAGTGCCCGGATCGCCACCCGCCCGCGTGCCGGGGGGTTCCTGATGATCCTGCGACTGC 
T CTACG ACG ACACCGG CCTCG AC AG CCTG G AG C AGCTCGG CTAC CTG CC CG AG C AG AACGCACTGTTCG ATCG CATGATG 
CGTATGCCCTACGGCATCAACATCCTGTCCGGCCCCACCGGGTCAGGAAAGTCGATGACCTTGAAGGTCACCCTGGAAGG 
CCiCGACAAGCrCCAi^GCGGATCCAAGCACATCCTGACCATCGAGGATCCGCCGGAATACCGCATTCGCGGCGAAGGCA 
TCAACCAGACCCCACTGGTCTACGACGCCACCGACCCAGACGCAGAACGCCAGGCCTGGGCCGCGGGCATCGCCAACGGC 
ATGCGCCTGGATCCGGACTACATGATGATCGGCGAAGTACGCGACCTCTTCGCCGCTGTCGCCGCGTTCCGTGGTGCGAT 
GACCGGGCACGGCCTATGGTCGACCCTGCACACCAACAGCGCGATCGGCATTGTCCAGCGCCTGAAGGACCTGGGCGTCG 
ACCCCGGCTTGCTGTTCGATCCGGCCCTGCTGACCGGCCTGATCAACCAGAGCCTGCTGCCCAAGCTCTGCCCCCACTGC 
- AAAGTGCGCTTCGAAGACGAGCAAGACCAACTCGCGCCCGACTTGGTCGAACGGGTCCGACGCTTGACCGATGTTTCCCA 



CCGAGGTGGTTC 

"TGCCCACCCTCGCCTTCATGCGTGTGTTCGCCAAAGGCGGCCCAGCCGAGGCACGCAACTACTGGGTCAAGACCATGCAG 
GGCATCACCAAGCACGCCCACGCCATCCGCCGCATCAACGAGGGCATGTTCGACCCGCAGATGGTCGAGGATTTCATTGG 
GCCACTCGACTTCGATGAGCATCTGCTCGACGACAGCTTCTACTCGCAGGAGGCGTGCTGA : 

Protein sequence: (SEQ ID NO: 183) 
MTNLQIAAIAQPSMVTQLLTADGGEWEVSKHLQEIMAl^ 

HQLYRAVAMDGLVDSDGQRATQMQERWKIIRKATELRASDVHFWSPAGTGSKIRFRVDGLLKTVEQFRSQELHELCAT 
IYQSMCDVAEPLFKPQLDQDARMSQTFVEKI^LFSARIATRPRAGGFLMILRLLYDDTGIJ5SLEQLGYLPEQNALFDRMM 
RMPYGINILSGPTGSGKSMTLKVTLEGLDKLHGGSKHILTIEDPPEYRIRGEGINQTPLVYDATDPDAERQAWAAGIANG 
MRI^PDYI^IGEVRDLFAAVAAFRGAMTGHGLWSTLHTNSAIGIVQRLKDIXSVDPGLLFDPALLTGLINQSLLPKXCPHC 
KVRFQDHQDQIAPDLVERVRRLTDVSQVHVKGPGCQACRGSGWGRSIVAEWLPTIAFMRVFAKGGPAEARNYWKTMO 
GITKHAHAIRRINEGMFDPQMVEDFIGPLDFDEHLLDDSFYSQEAC. 

DNA sequence: (SEQ ID NO: 62) 

ATGCGAACTGAGCCGATCGGCATGGCCGTGGCCGTGCTCTTCCTCCTCGCGTCTGGCCAGGCCTGCGCTGGCACCGTTGG 
CG AACTTGCGG AG AT C C AGGCC C AGGC C ATC CTCAC CG AGGC C AAGGTG CGCCTGG C C ACGG CG C AGCGG C AATTGG AAG 

GCAAAGGCGAAACCGGCCAGGTCGTCAGCGCCCAGGGGCAGACGTTCGCCATGCCGGTGCCGGCGGCGCCGCCGACGATC 
ACGCAGCCGGTTCCGCCAGTGGTGCGGACCATCTACGGCGCCGGCGGCAAGATGACTGCCACGTTCTTGTTCCCGGGCGG 
GTACGAGGTTGACGCCGCCAGCGGCGCGGAGCTGCCTGGCAAATACCGCGTCGAGTCAATCTCGCTGGACCAGGTCGTGC 
TCACCGACAAGGACGGCAACCGCGTGCCCGTGGGCTTCTCCAGCGTTGCCCCCACCCAAGCCTCCTCTACGGCCCAAGGC 
GCCTCGGTTCCGCCGGCGCTGCCCGGGGCTGTACCGCAGCCGTTCATTCAGTAG ~ * ~" 

Protein sequence: (SEQ ID NO: 184) 

MRTEPIGMAVAVLFLIASGQACAGTVGEUIEIQAQAILTEAKVRLATAQRQLEGKGETGQVVSAQGQTFAMPVPAAPPTI 
ASVPPAI^AV^ 

DNA sequence: (SEQ ID NO: 63) 

ATGGAGAAGCCTGACCTCGGCAGCCGTGGACCAGACGTCTCGATCCTGAGCTACCACGGCAACAAGTTCGTCAGCGGCCT 
GTTCTGGCGGCCGCTGTCCAGCCAGCGGCAGTACATGAAGGAAGCGCGCAAGCTGGGCAAGGAAGAGCATCTGGACATCG 
TTGCCATCCGCCATTCACCGACGGTGATCCAGGCCGGCTTCGTTTCGAAGTCGCAAGGCGCAGTCAAGGGGATGTACTCC 
CTGGCCTCGGCGCTTTCAGGCCAGTTCGACGGCGACTTCCTGGCCTGCTGGAAAGTCGACGAGGACCGCTACGCGCTGGT 
CGCCACGCTCGATGGCGCGATTGTCCCCGGGCAGGATCTGGTCACCACCCTCGACGAGGCCCGGGACCGGGTCAGGAAGC 
TCTCTACGCGCGGCGTGCTGCGAAACGCACAGGTCTTCGTTCCCGAAGGGTTCGATTTCCCCGTCAAGGACTTCGACATC 
GAGGAACTGCTCGCGCCGAAGCGCCTGCGGCGCGACTACCGCCTCCGGCAACTCACCTTCGGCTTGTCCGCCAGGGAGTG 
GACGGCAGTGGCCCTGCTCGGTTGCGTGGTAGGTGGGTCGCTAACCGCCTACTACCTATGGAATGCCCACCAGGAAGAGC 
TCG C C AGG C AAG CCG CG CTC CTCG AGG AGCAG AGGCGCCTCGCCG AGCTGGCCG AG AAG AACG CC CAGGC CAAG C AGCCG 
CTGG ACCTGGCGTC ATTG C AG AAGCCTTGG ACG CTCAT ACCTG AC CTCG AGG ACATGCTACGCGCCTGTAG CAAGG C AAC 
GGGGGTACTGTCGCTGTCGATCCAGGGCTGGCTCTTCGAATCCAGCAAGTGCGACGGCAGGGTCCTGGTCGCCACCTACC 
^ACCGTACCGGCAACAGCACAGCAGCCGACCTGACAGCGGCCAGCCAGCACCTGTTCGCCGACCGCCCCGCCTTCGTCATC 
GACAACGGCAACACCGCGGCCCTGAAGGTCGATCTGAAGGTTGCCATCGGCAGTGATGAGCCGCTACTGCCGGCGGACGA 
CGTTCTGCAGGCGCTGACGAGCCACCTGTACCGTCAAGGGGTCGAGCCCAAGCTGTCGATCAGCCAGGAGACAACTCCGC 
CCCTCCCTGGCGCGGAAGCTGCGACTGAACAGCAAGTGGTGTTGCCTTCCTGGAAGAAATTCACCTTCAGCGCCCAGACC 

CGGCTCCCGGCAGACCTGACCTTCCAGGGGCTGCCCGCTGCCGGTGTCCGCATCACCAACCTCGAAACCACGCTCAAGGA 
CAGCCAGTTGGACTGGACTGTCACAGGAGAAATCTATGCGAACTGA 
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Protein sequence: {SEQ ID NO: 185) 

MEKPD1X5SRGPDVSILSYHGNKFVSGLFWRPLSSQRQYMKEARK1^KEEHIX)IVAIRHSPTVIQAGFVSKSQGAVKGMYS 
I^SALSGQFDGDFLACWKVDEDRYALVATLDGAIVPGQDLVTTLDEARDRVRKLSTRGVLRNAQVFVPEGFDFPVKDFDI 
EELLAPKRLRRDYRLRQLTFGLSAREWTAVALLGCWGGSLTAYY 

LDLASLQKPWTLIPDLEDMLRACSKATGVLSLSIQGWLFESSKCDGRVLVATYHRTGNSTAADLTAASQHLFADRPAFVI 
DNGNTAALKVDLKVAIGSDEPLLPAIDDVLQALTSHLYRC^VEPKLSISQETTPPLPGAEAATEQQVVLPSWKKFTFSAQT 
RLP ADliT FQG L P AAG VRI TNLE TTLKD SQLD WTVTG E I YAN . 

RL085 

DNA~ sequence: (SEQ ID NO: 64) 

ATCGTGTGCGAAGCTACGGCAGATTCCGCGTCTACGATCGCAGCGCAGGTGCGCAACACCCGACCGGATCGGCGCGATAC 
GGTGGTGTTCTCCGACAAACCCTGGGTCAGCACGAAACCCCTAAGCGTTTCGCACACCTTGTCCAGTGACTGCATCGTGA 

cgtggcgccctgcaggcgcagcgtcgctgcaggaggccgcccaggaagtcatcaaccaatgccacatggcggtcagtatc 

acgcccgacgcgctgaacccggccgccttcgccgtgcaacctcagcagcgcgcgagcaacgccccgccgcccatccaagg 

cggccaggacatggccaccatgctgtttcctgcctccgtcgccaacggcatgtcgctcggtgccggcggcagcatggggt 

cgagcttcgggt.cctacggJccgcggtctctgtacaj^catcaaatggaacggcaaagtcagc^ 

gccgcccgagccggcgtgfcctggcgctacaacccaaccgagaaaa 

ccgcatgtacgccttcgacgacgtcaacacggtggactccaccgtgcgttccggtatgacgacggccgccggcatcagcg 

GGGACGGCTCCGGATCCACCGGACAGAATGGCAGCTCCGGCATCAGCGGCGACTCCGGCAGCAAGCAGACCACCAGCTCG 
GAGCTGAAGACATCGATCCTCAGCGACATCGAGAACAGCATCAACTCGATGCTGACGCCGAGCATGGGACGCATGTCGCT 
GTCGCGTGCCACGGGCACCCTGACCGTCACCGACCGTCCAGAAGTCCTCAACCGTGTCCAGCAGTTGGTCAACCGAGAGA 
ACGAGAGCATCACCAAGCAGGTGCTGCTGAACGTCAACGTGCTCTCGGTCGCCCTGACCGACAAGGATCAACTGGGGATC 
GACTGGAACCTGGTCTACAAGTCGCTCAACAACAAGTGGGGCATCGGCCTGAAGAACACCATGCCGGGCATCGATCAAAG 
CGCGATCTCCGGCTCCGTGAGCATCCTGGATACCGCCAACAGCGCCTGGGCAGGATCCAAGGCCATGGTCCAGGCGCTGG 
CCCAGCAGGGCCGCGTCTCGACCGTCCGATCCCCGTCCGTGACCACGCTCAACCTCCAGTGGGCGCCGATCCAGATCGGC 
CGCTACGACAGCTACCTGGCCTCCAGCCAGATCTCCAACGTCGCCCAGGTCGGCAGTACCACCTCGCTGATCCCGGGCGC 
CGTGACCAGCGGCTACAACATGAGCCTGCTGCCGTTCGTGATGGAAAGCGGCGAGATGCTGCTGAAGATCAACATCAACA 
TGACCTCCCGGCCGACGTTCGAAATGCAGACCAGCGGGGACTCCAAAGCCCAGTTCCCGAGCTACGACATACAACTGTTC 
GACCAGAAGGTACGTCTGCGCAGCGGCGAGACCTTGGTACTCTCCGGCTTCGACCAGACCACCGAGGACACCAACAAGGT 
CGG C AC CGG CG ACGCTG GGTTCTT CGGTCTTGGCGG CGG G CTG ACCCG C AATAC CAAGCGCG AGGTCATCGTGGTGCTGA 
TCACCCCCGTCGTGCTGGGCTGA 

Protein sequence: (SEQ ID NO: 186) 

IVCEATADSASTIAAQVRNTRPDRRDTWFSDKPWVSTKPLSVSHTLSSDCIVTWRPAGAASLQEAAQEVINQCHMAVSI 

TPDALNPAAFAVQPQQRASNAPPPIQGGQDMATMLFPASVANGMSLGAGGSMGSSFGSYGPRSLYNIKWNGKVSGFLDLI 

AARAGVSWRYNPTEKRVEFYYLDTRTFRMYAFDDVNTVDSTVRSGMTTAAGISGDGSGSTGQNGSSGISGDSGSKQTTSS 

ELKTSILSDIENSINSMLTPSMGRMSLSRATGTLTWDRPEVLNRVQQLWRENESITKQVLLNVNVLSVALTDKD 

DWNLVYKSLNNKWGIGLKNTMPGIDQSAISGSVSILDTJ^ 

RYDSYIiASSQISNVAQVGSTTSLIPGAVTSGYNMSLLPFVMESGEMLLKININMTSRPTFEMQTSGDSKAQFPSYDIQLF 
DQKVRLRSGETLVLSGFDQTTEDTNKVGTGDAGFFGLGGGLTRNTKREVIWLITPWLG. 
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RL086 

DNA sequence: (SEQ ID NO: 65) 

ATGACCAGGCAGTTGACCACTCTCACGCTGTGCCTGCTGCTCGCCAGCTGCACGACCCACAAGGCTGAGCCGGCCAGGCC 
AGCCTTCGACAGCAGCCGCAATCCAGACCTGCTTTCTCCGGACCTGTATCCAAACGGTGTGCAGCCGGAGAAAGAGCCCG 
TAGTGCGCTATGGGCGCTACACCCTGGTCAGCACCCAGCCTGATGCCGGTCAACGCGACCTGATGGCCCAGATCATCGAC 
GTAACCATCCCGTCGAGCATG7ACCCGAGCGTCAAGGACGCCATGCAGTACGTGATGAGCCGCTCGGGTTACTCGCTGTG 
CCCGGCAGACGCCGGTCATGTGAACATCCTCTACACCCGGCCGCTGCCGGCAGCTCAGTACAAGCTCGGCCCGATGACCC 
TGCGCAACACCCTCCAGGTCCTCTCCGGCCCAGCCTGGCAGGTTAAGGTCGACGAGGTCGCGCGGCAGGTCTGCTTCGTG 
CTGCGCCCGGGCTATCAACTTCCCCCGGCGCCGAGGCCGAAACCGGTCCAGCAACTGTATGCGAAGCCCGCTGCCCCAAC 
TCCGCCGGCGGTAGCGCAACCCTCCTCCACGGAGAAAGTCAGCACGCTGGAGTCGCCCATCGTGGTCGCCTCGGTGCCGA 
CACCGGCGCCGATCACAACCAGCCACGCTCCGGCCAAGAAGCCTGAATCCACCACTGTGCTCCCCCCAGCCGCACCGGCC 
AAGGATGGCCACCCCTCTTCTCCTCCCGCGGCTTCGGCACCGACCAAGCCTGCGGCCTCCGCCGTGAAGTCCACGCCGCC 
CACTCCACCCACCGTGGCTTCCGCCCCACCGGTCAAGGTGCTCACGCCGCCGGAACCGAGCCGGCCGCTGGCACAGGCCT 
GGTCAGCCGAGACGGGATCAACCCTGCGCGACACCTTGGAAGCTTGGGCAAAGCGCGCACGCTGGACCGTCCGCTGGGAG 
CCGCAGGATCTCAACTATCCGATCGAGGCTCCACTGACCTTCCACGGCTCCTTCGAGGACGCGGTATCCGAGCTGTTCCC 
CCTGTATGACGCTGCCGAACGGCCCTTCCTGGTGAACGCCAGCCGCCCGCAGTCCCTGATCATCATCAAGGAGCGCAAGA 
ACTGA .:- - . . - - - - - t — - - - 



Protein sequence: {SEQ ID NO: 187) 

MTRQLTTLTLCLLLASCTTHKAEPARPAFDSSRNPDLLSPDLYPNGVQPEKEPWRYGRYTLVSTQPDAGQRDLMAQIID 
VTIPSSMNPSVKDAMQYVMSRSGYSLCPADAGHWILYTRPLPAAQYKLGPMTLRNTLQVLSGPAW 

LRPGYQLPPAPRPKPVQQLYAKPAAPTPPAVAQPSSTEKVSTLESPIWASVPTPAPITTSHAPAKKPESTTVLPPAAPA 
KDGHPSSPPAASAPTKPAASAVKSTPPTPPTVASAPPVKVLTPPEPSRPLAQAWSAETGSTLRDTLEAWAKRARWTVRWE 
PQDLNYPIEAPLTFHGSFEDAVSELFPLYDAAERPFLWASRPQSLIIIKERKN. 

DNA sequence: (SEQ ID NO: 66) 

TTG AG CTTT AAAT ACTATTGGG CT AAATTTTTCTGGGG AG CTTTCTTCTTTGTTTTAGT CGCTTGG AAAGG CTCCGTATT 
TCCAAGCCTGGCATCAGTTT^ACCCCTTGGTAGTGGCTGGATTCAGTACTATCCTGTTTCCTTTCTCGGTAAGGCTTGTTG 
AAGACTTCGCTTTAAAATATACGGAAAAAGAGTTCTGGGTCACAGGTTTTTTCTCCGAAACCCCTGCAAAAACAGGATTG 
TATGCAGTCTTTTATTTGGCTTGTTATTTGTTTTCAATTCCCTTGGGGATGATTTTTTTATTCTATAAATACGGAAAGGC 
CTCGTAG 

Protein sequence: (SEQ ID NO: 188) 

LSFKYYWAKFFWGAFFFVLVAWKGSVFPSLASVNPLVVAGFSTILFPFSVRLVEDFALKYTEKEFWVTGFFSETPAKTGL 
YAVFYLACYLFS I PLGMI FLFYKYGKAS . 



RL088 

DNA sequence: (SEQ ID NO: 67) 

ATGTCCAATGACAACGAAGTACCTGGTTCCATGGTTATTGTCGCACAAGGTCCAGACGATCAATACGCATACGAGGTTCC 
C CCTATCG AT AG CG CGG CCGTTG CCGGG AAT ATGTTTGG CG ACTTG ATT CAAAG AG ACAT ATAT CTACAG AAAAACATTT 
ATTATCCAGTCCGATCCATTGTTGAACAAGGAACAAAAGAAAAGAAGGAGATCAACAAGAAAGTATCTGATCAAGTCGAT 
GGCTTGCT AAAG CAG ATC ACTCAAGG AAAAAG G G AGG CC A C AAGGC AAG AGCG AGTCG ATGTCATGTCGGCAGTCCTG CA 
CAAGATGGAATCTGATCTTGAAGGATACAAAAAGACCTTTACCAAAGGCCCATTCATTGACTACGAAAAGCAGTCAAGCC 
TCTCCATCTATG AGG CCTGGGTCAAGATCTGGG AG AAG AACTCTTGGG AAG AAAG AAAG AAG TACCCTTTTCAGCAGCTT 
GTTAGAGATGAACTGGAGCGGGCGGTTGCCTACTACAAACAAGATTCACTCTCTGAAGCGGTAAAAGTGCTAAGACAGGA 
GCTCAACAAGCAAAAAGCGCTAAAGGAAAAAGAGGACCTCTCTCAACTGGAGCGGGACTACAAAACCAGAAAGGCGAATC 
TCGAGATGAAAGTACAATCCGAGCTTGATCAAGCGGGAAGTGCTTTGCCTCCATTGGTCAGTCCAACGCCAGAGCAATGG 
CTTGAACGTGCCACAAGACTGGTTACGCAAGCAATTGCTGATAAAAAGCAGCTGCAGACCACAAACAATACTCTTATCAA 
GAATGCCCCAACCCCTCTAGAAAAGCAGAAAGCCATCTACAATGGTGAGCTACTTGTGGATGAGATAGCCAGTCTACAGA 
C C C GCTT AG AT AAGCTG AACGCCG AAACG AC AC G ACG C AGG ACAG AAGC AG AACGC AAGGCGGC C G AGG AAC AAGCGTTG 
CAAGATGCTGTTAAATTTACTGCCGACTTTTATAAGGAAGTAACTGAGAAATTTGGCGCACGAACATCAGAGATGGCGCA 
CCAACTGGCCGAAGGCGCCAGGGGGAAAAATATCAGGAGTTCGGCGGAAGCAATCAATTCGTTTGAAAAACACAAGGATG 
CG TT AAAT AAAAAACTT AG CCTTAAAGAT AGG C AAGC CATTG C CAAAGCCTTTGATTCTCTAG ACAAGC AGATG ATGGCG 
AAG AG CCTTG AG AAATTTAGCAAAGG CTTTGG AGTTG TAGGC AAAG CT ATTG ACGCCG CC AGCCTGT AC CAAG AGTTCAA 
GATATCTACGGAAACCGGGGACTGGAAACCATTCTTTGTAAAAGTTGAAACACTAGCTGCTGGTGCGGCCGCCAGTTGGC 
TTGTGGGTATTGCATTTGCCACGGCAACGGCCACTCCTATAGGCATCCTGGGGTTCGCACTGGTAATGGCAGTTACCGGG 
GCGATGATTGACGAAGGCCTTCTAGAAAAAGCAAACAACCTTGTAATGTCCATTTAA 
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Protein sequence: {SEQ ID NO: 189) 

MSNDNEVPGSMVIVAQGPDDQYAyEVPPIDSAAVAGNMFGDLIQRDIYLQKNIYYPVRSIVEQGTKEKKEINKKVSDQVD 
GLLKQITQGKREATRQER\T>VMSAVLHKMESD 

VRDELERAVAYYKQD SLS EAVKVLRQELNKQKALKEKEDLSQLERDYKTRKANLEMKVQS ELDQAG S ALP PLVS PTP EQW 
LERATRLVTQAI ADKKQI^TTNNTLI KNAPTPLE^ 

QDAVKFTADFYKEWEKFGARTSEMAHQIAEGARGKNIRSSAEAINSFEKHKDAI^KKLSLKDRQAIAKAFDSLDKQMMA 
KSLEKFS KG FG WGKAI DAAS LYQEFKI STETGDWKP FFVKVETLAAGAAASWLVG I AFATATATP I G I LG FALVMAVTG 
AMI DEGLLEKANNLVMS I . 

RL089 

DNA sequence: SEQ ID NO: 68 

ATGAACCGTCCACGCCTGGTTAATCGTACCTCCGCGACACCTTCGACGCTTCTGCAGCGGGCTATCTTCGACGGCTACGA 
CTTCGGCTTGAAGATCCCCTACATCGCAGGCAGCAATCGCGCGCTGCTGGAGCTGTCCGGCTTCTTCATCAGCGCCCGGG 
AGCATCCGTTGCACCGCTACTGGCGGGTCCCCAAAGGCAAGCTGCTGCCTGAACTGGACACTCTGTACAACCGTCTCGCC 
GAGCTAGCTGGAGGCCTTCACTCCCAGTCCTGGCGGGAGTTCAGCTCCTTGGTCGAATCCGCGCAGGCCTCGCTTGACCG 
ACAGGCCTTCACCTGGGGGATGCTGCTGCGCATCGCGCCCCTGGCCGAGGGCGGCGTCCTACTGTCAGGCGAGTTCCATC 

XTGGTGTTGTGGCGGT.GGCTCGGCGGAT.GCGCGGGGTATTCCTG.CGCCCATCGAGTTCCTGGCGCATCGACACCACTCCC 

^AGCTGCTCCGAAGC^CCTGATTCTGGAOT 

GCTCAGCGACGGCAGCTTCGCGCCGTCGACCGAGCTGCCCAGCATGAGCATCGGCGGTCCACA^ 

CATCCCTGGAGGACGAGTCAGCCTCTGACATCTACCTCGCCGCGGTGCCGGAGATCGAGCGCACCGAGTACAGCTCGGCT 
GATATCGAGGCGGCGCTTCAGGGCTACTCTCTACTGGCCCACCAGCCTGACGGCATCGCTCATCTGCTGCAGAGAACCAG 
CGC CTTATTGGC CG ACG ACATGG G ATTGGG C AAG ACC CG C C AGG CGGTCATCGCCGCTTCG ATCCGCGCGG CGGGCAG AC 
CAATCCTGGTCATCACCCTGGCTACCCTGCTGATCAATTGGCAGCGGGAGATCCAGGAGGTCTATCCCTCGGCCACCGTG 
GCCATCCAGCAGGACACCCCAGAGGCGCAGTGGATCCTCGTCAACTACGAGCAGTTGAGCCCCTTCGTCGCCAACGCTTC 
GCGCTTCGCCGTGATGGTCATCGACGAGGCGCAGCGGATGAAGGAACCGACGGCGCAATGCACGCGGCACGGTTTCGACA 
TTGCCGCCCAAGTGCCGAACCGCTACCTGCTTACCGGCACGCCGGTGCTCAACCGCGAGACAGAGCTGCACACCCTGCTG 
CGCCTCTCAGGCCACCCCATCGGCCAACTGCCGCTGAAAGAGTTCTGCGACCGTTTCGCCGGCAACCCGGAGTTCCGCCA 
GAGTCTGCGGGCGGAGCTGGGTGACTGGATGCTGCGCAGGCGCAAAGATGTGCTGCCCAGCCTCAAGGGCAAGCAGCGGC 
AGTTGCTGAAGGTGGCCCTCTCCACCGAGGAACGCCAGCAATACGACGTGCTGCGCCTCGAGGACCGACCGGTCTTCGCG 
CGACTCGGCGCGCTGCGGCGTTACCTGGAAACGGTGAAAGTTCGCGTGGCGATGGACCTGTTGAGCGAGCTCGACGCAGA 
GGACAAGGTGATCCTGTTCTGCGAGTTCAAGCCGACCGTGGCTGCGCTGAAGGAACTCTGCGAGCAGGCCGGACACGGCT 
GCGTCACGCTGGTGGGCAATGACTCGCTCACCAAGCGGCAGAAGGCGATAGATCGCTTCCAGCAGGATCCCGACTGCCGA 
GTGTTCATCTGCACTACGGCGGCCGCAGGGACGGGCAACAACCTCACTGCGGCGAACTACGTGTTTTTCCTCGGCCTGCC 
CTGGACTCCCGGTCAGCAGGAACAAGCCGAAGACCGCGCGTACCGAAACGGCCAGCTCCGCATGGTCGTGGTGAAAATCC 
CACTGGTCGAGGCCACGATCGACGAGCAACTGTGGCAACTGCTCAACGCGAAACGCCAGGTTGCCCAGGACCTCATCGAG 
C CCGAGC AGGTCGACGG AAAC CG CGCGCTTTT AGC CGC AAGCCT AACTGG ATAA 

Protein sequence: (SEQ ID NO: 190) 

MNRPRLVNRTSATPSTLLQRAI FDGYDFGLKI PYI AGSNRALLELSGFFI SAREHPLHRYWRVPKGKLLPELDTLYNRLA 
ELAGGLHSQSWREFSSLVESAQASLDRQAFTWGMLLRIAPLAEGGVLLSGEFHPGWAVARRMRGVFLRPSSSWRIDTTP 
ELLRSNLILELGLAEEQFEILDTVQELLSDGSFAPSTELPSMSIGGPQQEPAAPSLEDESASDIYLAAVPEIERTEYSSA 
DIEAALQGYSLLAHQPDGIAHLLQRTSALLADDMGLGKTRQAVIAASIRAAGR 

AIQQDTPEAQWILVNYEQLSPFVANASRFAVMVIDEAQRMKEPTAQCTRHGFDIAAQVPNRYLLTGTPVLNRETELHTLL 
RLSGHPIGQLPLKEFCDRFAGNPEFRQSLRAELGDWMLRRRKDVLPSLKGKQRQLLKVALSTEERQQYDVLRLEDRPVFA 
RLGALRRYLETVKVRVAMDLLSELDAEDKVILFCEFKPTVAALKEL^ 

VF I CTT AAAGTGNNLTAANYVFFLGLPWT PGQQEQAEDRAYRNGQLRMVWKI PLVEAT I DEQLWQLLNAKRQVAQDLIE 
PEQVDGNRALLAASLTG . 
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RL090 

DNA sequence: (SEQ ID NO: 69) 

GTGGCACCTCTCGACAACGCCCCCCCTAGCGGGCCGCTACAGGATCCATCCCTGGCCCGCTACAGCGAGCGGCAGCTCGC 
CGTCGCCAACACCTGGGCAACACATTTCTCCCTCGCAGGGACAGCTCGAACCAAGTTCATCCGTCACTACCTGCGCAGCA 
CATCTACGACCAGGTGCTGGTGCATCACAGTCGCTGCTGACAACGGTGTGCGTTACACCATCATGCGTGCAGGGCCGCTA 
CTCCAGGTATTCGACGGTCAACTAATTGGTGCGTGGGAGTGCAAGCCTGCCCATCGTATCCCGGCAAGCACGCCGTCTCG 
AGCAGGGGCCTTGAAGCTGCTACAGCGCCTTCAAAAGTTCGACGACGCAGTTGCTGTACTCAGCTCATACACAAAGCGAG 
CGCACGACCTAGCCACACAGATGGCCAGGGACGATCTCGGACTTCAACATCGCCTCGTGTATCCGAGCCACAGCAACAAG 
CGCTACTACGCGCCAAGGCACCAGTTCTACTTGAAGCAGATCGGAGCGGTCTTGCGAACCTTCAGACAGGTCCTGGACCA 
AGACCTGCTGTTCGCCATCCGCTCGGTTCGGTGCCTCTCGCCCCAGCTCTACAACTGGCTGGCTCAAGGCGACCAGGTGC 
G CCGG CTG CAAATG CTG AAGG CTCAG CCGG TCTTG ACGCCGCT ACTGG TGG ATTG CG AGG AGGG AGTCTGG CCTCACACG 
ACGACCAACGACAACGGCGAGAGCATCCGCCATTACCTTCCTTGCCCCTTTCCCCAGCTTGACAGTGAACGACCGCAGGC 
CGCCGCCATGCCATGCGACTTGTACCTCGATATGGGCCGTATTCTTGGGCAGGTCGCGGACGAAGGAATTTCGGTCATCA 
ACTTTTTCGCCTGGCTATTTCAGGCGCCGCGGGCCTCGATTCGATTTCTTAGTCACGTCAGTCCCGGCCGTGCGGGAGGA 
GCTCTCTTCCATCGCAAACGGGAAGGCCGACATTCGGGATGGCATGCTCTCCTACTGGCGGCATCGCTAGGTAACCGGCG 
GCCGATCACTCGCGCTCAATGGACAGCATTCTATGCCGCCTACAATGCGATCCCTTGGCAAGTTCACAACGCCAAGCCCG 
ACTACAACCGTCTCTTCAACGGCTGCCCGTCGGATTGGCAGGATCCGGCATGGCTTGCAATCACTGCACGGCTGAGAGAC 
ATCAAGGAGTTCTATACCGCCCTCGACCAGGGGAACTCACAGGTTGTTCGGCAGGCGCGCAGCGCCCTGAAAGCGTATCT 
GGGTCATTGTACCTACCGACAAGCTGGCAACCTGGTGGACGACTACCACCAGGTCCAGAGGGAGCTGCGTGCCGCAGTGC 
AG AG C AGC CTGCC CG ATCTGGT CG ACAC CG ACG AG TAC ACC ACCTGGG AGGG AATG CTGTCTGTCGGTCTT ATCG ATTGC 
CCTAATGGACTGCAGATCGTCGAGCTCCGCTGTCCTGCCGACCTATATGCCGAACATATCGCTCTGGCACATTGCATCGA 
TAGCTACGACCAGGCCGCCTACCGAGGAGACTGCCGACTGCTCTCAGTACGTGAGGCTGGTCGTCCGCTGGCCTCTGCCG 
AATTGGAGCTCAGGCGTGAGCATGGCGAGCCTATAGGTAGGCCCTGGAGTCCCAAGCACCTTTCCACGGTGCAACTGCGC 
G AATTCG ATAATG CCCCCG TG C CG AC CG ACT CG CCTG CCGG C CAGG CATACCGCTGGTTCATGG AACGAATTCG CT CTGG 
AGCCATAGCGACGAACCTGAACTGGCCCGACATGACCGTCCACATGACGCGCTTCGCCAATGGTCGCTGGAAGGCGGGCC 
TCGCCGAAGCCACGGCGAAGTGGCTGCTCACTCAGTTGGAAGACCGATGA 

Protein sequence: (SEQ ID NO: 191) 

V AP LDN AP P SG PLQD P S LARY S E RQLAVANT VJ ATHF S LAGT ARTKF I RH YLR ST STTRCWC I TVAADNGVRYTI MRAGPL 
LQVFDGQLIGAWECKPAHRIPASTPSRAGALKLLQRLQKFDDAVAVLSSYTKRAHDLATQMARDDU3LQHRLVYPSHSNK 
RYYAPRHQFYLKQIGAVLRTFRQVLDQDLLFAIRSVRCLSPQLYNWIAC^DQWRLQMLKAQPVLTPLLVDCEEGWPHT 
TTNDNGESIRHYLPCPFPQLDSERPQAAAMPCDLYLDMGRILGQVADEGISVINFFAWLFQAPRASIRFLSHVSPGRAGG 
ALFHRKREGRHSGWHALLLAASLGNRRPITRAQWTAFYAAYNAIPWQVHNAKPDYNRLFNGCPSDWQDPAWLAITARLRD 
IKEFYTALDQGNSQWRQARSALKAYLGHCTYRQAGNLVDDYHQVQRELRAAVQSSLPDLVDTDEYTTWEGMLSVGLIDC 
PNGLQI VELRCPADLYAEH I ALAHCI DS YDQAAYRGDCRLLS VREAGRPLASAELELRREHGEPI GRPWS PKHLSTVQLR 
EFDNAPVPTDS PAGQAYRWFMERIRSGAI ATNLNWPDMTVHMTRFANGRWKAGLAEATAKWLLTQLEDR . 



RLiOSl 

DNA sequence: (SEQ ID NO: 70) 

ATGCGAAAAGAGAATATATCTGCCGAAATCACAGAGCGAGCTTTTGATTTTTTCTATTGGTTCTCGCGATTTGAGTTCAG 
CCTCAAAGAGAATGGCTACTTAAAAAATTACAAACCTGGAGCTAGGGCAGAGCCGGGATGGGAAAATTTTGTACAAAACC 
ATTCTGACAAATACTCTCTTTCCCAATCAGCCACAGCACTAATCGAGCAGAGTCCAGAGCAACAAATAGTCCTGCCCGGT 
AGAGAGCTGGGTTGGCGTCCGGTTAAATTAGATGAGGACAAAAGCGACTTAGCTAGAGTCGCTCGCTTACTTAAGACCGT 
GCGAAACAATCTATTTCACGGAGGCAAGCATGGTGGTGCCAACTGGGACAACCCAGCGAGGACAATACATCTTATTCTTT 
T AAGTAAAG CT ATCCTTG ACG AGTTTG CTG C ACTAGG AG ACT TTG AGG CTG ACT ACAAG AG AATTT ACTGA 

Protein sequence: (SEQ ID NO: 192) 

mrkenisaeiterafdffywfsrfefslkengylknykpgaraepgwenfvqnhsdkyslsqsatalieqspeqqivlpg 
reuswrpvkldedksdiarvarllktvrnnlfhggkhggaw^ 
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RL092 

DNA sequence: (SEQ ID NO: 71) 

ATGCACATCGTAATCATTGAAGCCCCGGGCAAGCTGAAAAAGCTGAGGTCCCTTCTCCCCTCGATTGGTCCCGACGTGAC 
CTGGCAGGTCGAGGCGACAGCCGGCCACATCAGAGACCTACCCGTTCACGGGCAGGATCCGCAGATGCTCACCGTCGGCG 
TGGGCCAGGATTTCAAACCGCACTACCAGATCCTCTCGGGCAAGGAAAAAACCGTCGCACGGCTGAAGGAGCTGCGGCAG 
AAAGCCGTGGAAATCTACGTCGCATCGGACCCGGATCGCGAAGGCGAAAGCATTGGCTGGCACATCCTCCAAGCTGCCGG 
GATCAAGAACTACAAGCGCGTTGCCTTCAAAGAAATCACAAAGTCATGCATCACCGCCGAACTCAGCTCGCCGCGTCGCC 
TGGACCTCCCGAAGGTCGCCTCGCAGGAATGCCGTCGCGTCATCGATCGCCTGGTGGGGTATCTGGTCACGCCAGAGTTG 
CGGCGCGTGATGGGTAGGCCGACCACCGCCGGGCGCGTGCAGTCCGTCGCGGTGTACCTGGTGGTCCTGCGAGAGCGGGA 
GATCCGCGCCTTCACAGCAATCAAGCACTTCGGGGTGGAACTGACCTTCGTTTCGCCCAGCGACGGCCGTACCTGGACGG 
CGGAATGGGATCCAGTGCCCGTGTTTGCCAGCGAGGAGTTCCCGTATGTCCAGGATCGTCAACTCGCAGAACTGGTGGGG 
GCTATACGTAaTGTCATCGTCGAGACCTGCAT^ 

GCTCCAGATGGCCGCCGGGAATGCGCTGAAGTGGTCACCCGACAAGACGATGAAGGTCGCCCAGCGGCTGTATGAACAGG 
GGCTCATCACCTACCACCGGACGGACAACCCCAATATCTCGAAGGACTCGATGCCGGATATCCGTGCTGTCGCCAAAGCC 
TTGGGGCTGAAGTGTGTTGAGCAACAGCGGATGTTCAAAGCGGACCAAGACGCCCAGGAAGGCCACCCCGCCATCACCCC 
TACCGACTGGATGGCCGCTGCCGCCGGTGAAACTGCTGATGAGCAGGCGCTGTACCAGCTCATTCGAGTCCGCGCGCTTG 
CCAGTCAGATCGAAGCTGCCGTGTACGCAGTGAGAACCATCACCCTCCTGGGCGTCGGCCCCGACAAAAAGCCGCTGCGC 
T-T-CGGGGCCAAAGGGAAGCTGTTGAACGTGCCTGGGTGGAGAAAAGTGCTGGAGGGTGATGAGGCCGAGGAGCAGAAGAA 
CGAAACGCCTTCAAACCCCATCCCGATCCCGGCGCTGGAGCCACGCCAGATACTCAAGGTCTACAGCGGCGAGGTCCTGG 
AGAAGAAAACCACCCCTCCCAAGCGATTCACCGACGCCAGCCTGGTGGGCGAGATGAAGCGCCGCGGGATTGGTCGGCCA 
TCCTCCTACGCCTCGATCGTGAAGAACATCATCGACAAGGGCCAGGTGCAGATGAAGGGGCGAAGCCTGATCCCCGGCGA 
GCTGGGAGAGGCCACCATCGCGCTCCTGGAGCACAACTTCAGCTTCCTCAGCCTCGACTTCACCCGCAACCTCGAGGTCG 
CCTTGGACCGGATCGCCAACAGCGAGGACACCTACATGAACGTGGTCCAGCAGTTCTACCAGCTACTACAGACAGAGCTG 
CAGACACTCCGCGCGCTCCCCAGCGCACAGGACGAACCACGCGCAAGCTCCACCGCCAGTATCTCCTCGGCGCCGACCAG 
CGACTTCCTTTGCGGCAAGTGCGGTCTGCCCCTGGTTCACCGCAAGAAAGCCGGCAAAGGCGGCTTCGACTTCTGGGGTT 
GCAGCGGCTATCGAACAACAGGGTGCAAGGTTAGCTACCCCACCAAGAGCGGCCGGCCTGACTTCGACAACCCGCGCGGG 
CTATAG 

Protein sequence: (SEQ ID NO: 193) 

MHIVIIEAPGKLKKLRSLLPSIRPDVTWQVEATAGHIRDLPVHGQDPQMLTVGVGQDFKPHYQILSGKEKTVARLKELRQ 
KAVEIYVASDPDREGESIGWHILQAAGIKNYKRVAFKEITKSCITAELSSPRRLDLPKVASQECRRVIDRLVGYLVTPEL 
RRVMGRPTTAGRVQSVAVYLWLREREIRAFTAIKHFGVELTFVSPSDGRTWTAEWDPVPVFASEEFPYVQDRQLAELVG 
AI RNVI VETC I DS EETDAPPAP FI S SS LQMAAGNALKWS PDKTMKVAQRLYEQGLITYHRTDNPNI SKDSMPD IRAVAKA 
LGLKCVEQQRMFKADQDAQEGHPAITPTDWMAAAAGETADEQALYQLIRVRALASQI^ 

FGAKGKLLNVPGWRKLLQGDDAEEQKNETPSNPI PI PALEPRQI LKVYSGEVLEKKTTPPKRFTDASLVGEMKRRGIGRP 
SSYASIV^IIDKGQVQMKGRSLIPGELGEATIALLEHNFSFLSLDFTRNLEVALDRIANSEDTYMNVVQQFYQLLQTEL 
QTLRALPSAQDEPRASSTASISSAPTSDFLCGKCGLPLVHRKKAGKGGFDFWGCSGYRTTGCKVSYPTKSGRPDFDNPRG 
L. 

RL<)93 

DNA sequence : SEQ ID NO: 72 

ATGGATCAAAGCCTTTGCACATGCATGCCAACGCCAATCGTCAACCCCAAGGAGCTGCGACTGTGCCACATGTTAGTCGG 
TAGAACTTTCCCGATAACATTGATCGCAGGCGACCATTGGTTGAGCTATGACGGCAGCGCCTGGTGGGTCGATGCGGATG 
AGCCCGCGACGGAGGACGAGGTGGCGGCTCTGTTGGTCAAGGCTGGTGGTGTCACTACGTGCTGGTGCGGATAG 

Protein sequence: (SEQ ID NO: 194) 

MDQSLCTCMPTPIWPKELRLCHMLVGRTFPITLIAGDHWLSYDGSAWWVDADEPATEDEVAALLVKAGGVTTCWCG. 
iRL0?.4 

DNA sequence: SEQ ID NO: 73 

GTGGCAAGGGCTTCCGAATCGGAAATCTCGACCAGTACGAGGTGCAGTGTGTCAAAGAGAGCGACCGATACCGACAAGCT 
G G ACAG ACG A C A CTTC AACG ATCCCCACCGG ACTGT ACGGG CT ATTGGTGCTG AGG CCGCG CGG AAAGGG CT ACGGGTGT 
TCGACTGCCCCTACAGTCATCCTGCGATGCGGGCGTCCTGGTTGAAAGGGTTTGCCCAGGAGCAGCAACAGCAGCTCGAC 
TTCTGA 

Protein sequence: (SEQ ID NO: 195) 

VARAS ES E I STSTRCS VS KRATDTDKLDRRHFND PHRTVRAI G AEAARKGLRVFDCPYSHPAMRAS WLKGFAQEQQQQLD 
F 
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RLQ95 

DNA sequence: {SEQ ID NO: 74) 

ATGGCTACCCCCGTCTTCTGGGAAGCCAACATTGGCTCC-GCGCCGGAGCACCGCAGCTTCCCCAACGGCAACAATCCCCC 
GCGGCAGTTGCTGCGACTGAACGTGATGTTCGACAACTCGATTCCCGATGGCCAAGGTGGCTACAAGGATCGCGGCGGCT 
TCTGGTGCAGCGTCGAATGGTGGCATCAGGATGCCCAGCGCTTCGCCGAACTGTTCACGAAAGGTATGCGCGTCAAGGTC 
GAAGGCAGGGCCATTATGGACCGCTGGCCGGACAAAGAGTCAGGCGAAGAAGTCCAGGCGCTGAAGGTCGAAGCCTCGCG 
CATTTCCATCCTTCCGCATCGCCTGGCCGAGGTCACCCTGTTGCCAACCCAGCATCAACAGTCTCGGAACGTCCCGCAGC 
AACCTGCTCAGCAAGATGCGCAATCGCAGCAGGACTACGACAGCGCCTTCGACGACGACATCCCCATGTA 

Protein sequence: (SEQ ID NO: 196) 

I^TPVFWEANIGSAPEHRSFPNGNNPPRQLLRLhT^FDNSIPDGQGGYKDRGGFWCSVEV^QDAQRFAELFTKGMRV^ 
EGRAIMDRWPDKESGEEVQALKVEASRISILPHRLAEVTLLPTQHQQSRNVPQQPAQQDAQSQQDYDSAFDDDIPM 

RL096 

DNA sequence (SEQ ID NO: 75) 

ATGCGGCAGCTCGAT7VAGGACCAGCAAGGCGCTCTGGAACAAAGTGCCTTCCGCCCACTGCAACAAACTGCCTTCCAGGC 
GCTGCAACACAGTGCCTCACTAAAAGGCCTTTTAAAGCCTTTTAAAGGTAATAGGGAGCTGGCCCAGTTGGCGGAACAGT 
--GCGAAGGCATGGAGCAGGGATTGGTTGAAeTTGGCCAGGGACTGCTGGCGCAGGTTGGTGGGCCACCCTTCACTCTACTG 
CCCACCCGACTCATCGAGCAGCGCACATCCGCCCGCACAACCTTTCTCCGCTGGCAGCACATTGCATCCCGGCGGATGGG 
CGTCGGGGTGTGGACGGAAATGCTGCGCCAGGACAAGACCCCGGAATACCTGCTGCAAGACCTCTACGAGATGGAGCTGC 
AGCGCATCACCCTCAACATGCAGATCAGCCTGATCCACTCCATCGGCAAGCAGGCCGCCGAGTGCGCGGAAAAGATGGGC 
CAGGCCGAGGCCGAGTTCATGGGCCGACTGCAGCAGAGCACCAACCACCACTGA 

Protein sequence: (SEQ ID NO: 197) 

MRQLDKDQQGALEQSAFRPLQQTAFQAI^HSASLKGLLKPFKGNRELAQLAEQCEAMEQGL^ 

PTRLIEQRTSARTTFLRWQHIASRRMGVGVWTEMLRQDKTPEYLLQDLYEMELQRITLNMQISLIHSIGKQAAECAEKMG 
QAEAEFMGRLQQSTNHH 

RL097 

DNA sequence: (SEQ ID NO: 76) 

ATGGCTGAAACCCATCGGCTTCAGATCGGCTCTCTCCGCAGCGATGTCGCTCTGACGCTTCACACCTATCACGCCGCCCG 
CATCTGGACCGGCCGGCAGAAGTCGGATGCCAAGCACAGCATCCTCGGCCTCTCCGGCTTCTGCGCATACGTGAATCGCA 
TGCACCGCGGGGCAGCACAGGACGATCCGTACTCCGACTGGTGGCTGGTTCAGATCGAAGAGAAAGTCGAGAGCTGCCAA 
GCCGCACTCGAGGCCATCGACCAGCGACTGGATGACGTCATGGCCAAGCTGCCCGCGACCCTCGATATCTCCGAGAACCT 
GTCCGTTACACCGGTCAAGGTCCCGTTGTTCATCTCCAACCCTCTCGGCTTCAAGGCAGTCTATCTCCTGACCAACTACG 
ACGAACTCGCCCGTCGAATCCTGCTGGCCCAGCACGTCGGGCTGGTCGGTCGCCGCGACATGGAGGTCTGGCTCGACGAA 
GGTGCGTCGGTGCTGCGAAGCCTGTTTGGTCTGGCCCAGAGCTACCAGTTCTCGGGCGCCACTCGCGACGACTTCGCCGC 
AAACAATGCTCGCGCCGAAGCCGCGCGGAAGATGTACGAGAAGTTCGGCGAGATCCCGCAGGACATCCTGGAGGGCACTC 
GACGCTCGAACTTCGCTCCGCCGATCACCCGGGGCCGCTCTGACGGTGATGCCGATGATGACGCTGACCGTGTCGAACTC 
GAGGACTGA 

Protein sequence: (SEQ ID NO: 198) 

MAETHRLQI GSLRSDVALTLHTYHAARI WTGRQKSDAKHS I LGLSGFCAYVNRMHRGAAQDDPYSDWWLVQI EEKVESCQ 
AALEAI DQRLDDVMAKLPATLDI SENLSVTPVKVPLFI SNPLGFKAVYLLTNYDELARRI LLAQHVGLVGRRDMEVWLDE 
GASVLRSLFGLAQSYQFSGATRDDFAANNARAEAARKMYEKFGEIPQDILEGTRRSNFAPPITRGRSDGDADDDADRVEL 
ED 

RL09.8 

DNA sequence: (SEQ ID NO: 77) 

ATGTTCTTGAGCATGGCCCCTTTCTTTTTGGTCGTTCTCGTTCTTTCTGCACTTTTTACAGATGCGTGGAACGACCGAGA 
ACTCAGGCTGTTGTTAATGCTGATCGTGTTCGGGTATTCAGTAACCGTGTTGACCATTACGGTTGAGATGTATCGCTTTG 
AAATGGCGGAAAAAGCGATGTGGGGAGCTTTATGCAACAAAGCCAACTACATGAACTGCCAACCAGATTACCAACGGTAG 

Protein sequence: (SEQ ID NO: 199) 

MFLS^PFFLWLVLSALFTDAWNDRELRLLLMLIVFGYSVTVLTITVEMYRFEMAEKAMWGALCNKANYMNCQP 
RL099 

DNA sequence: (SEQ ID NO: 78) 

ATGAGAAAGTCTCGGTCGGGCGTCGTGTTTTTTGGTGATGCGGCCCGCATCACTCTCCCAGGTCCTGACCTCCGCGCCGC 
CGGCGAGCTGGGTGATTCCACTGGAATCACTCCACCAGGAGCCGACCTCCGCGCCGCCGGCGAGCTGGGTGATTCCACTG 
GAATCACTCTGCCAGGGATCCACTTCGGTATCGGCGGCAAGATGGGTGTTTCGGGCCGAAACACTTCGCCAAAGCGAGGC 
ATCACCACTCACGAGGAACTCAAACAATGTTCTTGA 
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Protein sequence: (SEQ ID NO: 200) 

MRKSRSGWFFGDAARITLPGPDLRAAGELGDSTGITPPGADLRAAGE1X3DSTGITLPGIHFGIGGKMGVSGRNTSPKRG 

ITTHEELKQCS . 

FIGURE 

RL1O0 

DNA sequence (SEQ ID NO: 79) 

ATG AGGCTGT CG CG CTTT C CC ATTTCG ACACTGCTGG ACTCGGCCTCGGGGC ATCTCG AGGCCCATTTGTAT AAG AAGCG 
GCTTGCTGCCGAAAGCGGCGAACCGCTGGCTCAACAATATTCCGGCATCATTTTCAGCGGCAATCCTCATGAAACCGTTC 
CACGGCGCCTCCTCCTGGATAAGCGTCTTACTCCGCTGGAGCGGAACTGCTGGCAAGTTTTCCGCTTGCTCATCAACGAC 
GACGGACTGACCGCGTTCCCGACATACGAGCAACTGCGCCCCTATCTCGGTATGCAGCCGGGCAAGATCGCCTCGCGCGA 
AACCATCGCCAAGGCACTCACGGTCCTTCGTCTGACCCGCTGGCTCAGCCTCGGCCGACGCCTGCGCAACGACCTCAACG 
GACAGGTCCAGGGCAACGTTTACATCCTTCACGACGAGCCTGTCTCTCCAGCCGAAGCCTTGGAGCTGGACACCGACTAC 
ATGCAGTTGCTGAGCCAATCCACCGGTCACGGCAACCGAGCCATACGCGAAATCGGGCAGATCATCTGGCGGGAGTTCAG 
GGATGATCCGGACGTGGGTCGCCGCCTCCCTACCCATCTGGAGAAGCTCGAGGGACGCTTGAACCACCAGCAATGGGCTA 
TCGATAGTCAGCTCGAAGCGGATCCAGCGGCAGAGTTCGGCATCCGAACTCTGTCGGATTTACCTCATTCCACCCCGAGT 
TCGGATGCCGAACTCAGTGAAATCAGCGGCAAGCAATGCGCTCTACCGCTGAGTTCGGATACCGAACCCCGACAGAATCC 
GCGGAGTACGCCCTTGGTTCGGATGCCGAACTCATATAGTACGTATACATACAAACAAGATTG-TGTATGTAAAAAGCCAG 
"TACAACCGCGAGCACGCGAGGAAGGCCATCCGAACTGGCAGGATCTCCTGGACGGACTGGAGGCCGAGGAACGGATCCAG 
GCAGTAAGCGCGCTCAGACGGGTGTCCGAGGATCTTCGGCTACCCATCATGGAGCAGTGGCAGCACCGTTGTGCCGGCGG 
AACAGTCAGCAATCCGTTCGGCTACCTCATGACGCTCATCCAGCGTGCAGTCCAGGGCAAGTTCAACGCTTCTTGGGCTC 
CGGAAGAACCGGCTGAGCGAACCATCCCCGCAACGGAACGCCCCATTCGTGCTCCGGCACCATCAAGCCCCATAGCGCCT 
ACACAGCCTCAGGTCCAGCCCCGGGGGGATACCCGGACAGGGAGCGAGGTCCTCAGCCGGCTCAAGGACCTCATTCGGCC 
CAGGCACGGATCGAGCGTGCCATCCGAGCGGGGTGATGATTCATGA 

Protein sequence: (SEQ ID NO: 201) 

MRLSRFPISTLLDSASGHLEAHLYKKRLAAESGEPLAQQYSGIIFSGNPHETVPRRLLLDKRLTPLERNCWQVFRLLIND 
DG LTAF PT YE QLR P YLG MQPGKIASRETI AKALTVLR LTRWLS LGRRLRNDLNGQ VQGNVY I LHDE P VS P AEALELDTD Y 
MQLLSQSTGHGNRAIREIGQIIWREFRDDPDVGRRLPTHLEKLEGRLNHQQWAIDSQLEADPAAEFGIRTLSDLPHSTPS 
SDAELSEISGKQCALPLSSDTEPRQNPPSTPLVRMPNSYSTYTYKQDSVCKKPVQPRAREEAHPNWQDLLHALEAEQRIQ 
AVSALRRVSEDLRLPIIEQWQHRCAGGTVSNPFGYLMTLIQRAVQGKFNASWAPEEPAERTIPATERPIRAPAPSSPIAP 
TQPQVQPRGDTRTGSEVLSRLKDLIRPRHGSSVPSERGDDS. 



R||ioi 

DNA~ sequence: (SEQ ID NO: 80) 

ATGTCGAAGTCGACGATCAATGAAGCGGTCCTGACGCAGGTGCTCAACCACCTGCGCAACGGCCAGCTCAGGCGTTGTGC 
CGAGATGGGGCTGCGGCCGGAGATTCTGGCTCAGCTCCAACAGCCTGCCGTCATGAGCATCCTGACCAATACCCCGGTTT 
CCTGGGTAGATGTCAGAGTGAACATCGACGTCATGGAGAAAATCCTGGCCACAGCCGAGCGCAGCGCGCAGGAAGACCTG 
CAGATCGAACGCGCACTGAAGCTGGGAGCCACCACAACGATGATCCAGAGCTTTTTCGGTCTGTCGCCGGAGGACACCGC 
CACCAAGCGCTTGATGCTGGAGATCCACCCGCGCCGCGGTCGCTGGCGGCAGCTCGATGAACAGATCGAGCGCCAGATAT 
GGTTCCGCTGGGAGCACCTGATGCAGGAAAATCAGGTCCGCCTTGAAGACAGCATGGAGTTGCTGGACATCGCGATGATC 
CTCACAGAGGAAATCAACGCCGGAATCGAACAAGACAGTCCAGAATTCATCAGCCTCGCCATTGTTTGGTCTCTCATCCA 
GAGCTGGTTGAAAGACGGGCTCTATCCGTCTGGCAAATCGAGCCAGAGCCAGGCGGGCCTGCAAAAGTCCCAATCCACTC 
TTT ACCTCG CTAG CGTC AG CTC A C ACCTGCCCC ACTCTGCC CC ATC CG C AAC AACG C AGGTG AACGCTG AG ACAG AACGT 
CAACAACTACTGAACCTGGTTCAGTCGGAAGGCGACACAGCACCATGA 

Protein sequence: (SEQ ID NO: 202) 

MSKSTINEAVLTQVLNHLRNGQLRRCAEMGLRPEILAQLQQPAVMSILTNTPVSWVD\mVNIDVMEKILATAERSAQEDL 
QI ERALKLGATTTMI QS FFGLS PEDTATKRLMLE I HPRRGRWRQLDEQI E RQ I W FRWEHLMQENQ VRLE DS ME LLD I AM I 
LTEEINAGIEQDSPEFISLAIVWSLIQSWLKDGLYPSGKSSQSQAGLQKSQSTLYLASVSSHLPHSAPSATTQVNAETER 
QQLLNLVQSEGDTAP . 
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RL102 

DNA sequence: (SEQ 

ATGAGTATGGCCAAGATCAACCCGCAAGATCTGAAAGACCGGCTACTTGCCCCTGGTTTTACCGCACCGCCCAAGGTTCT 
GGAGCAGTTGTCGGACCCGATCAGCGACACGCCCATGAGGCTGACACTCCACGACGTCCTCCCCTGGCACGATAACCCCC 
GGACCACTCGAAACCCGAAATACGATGAGCTGAAAGAATCGATCCGACATCGAGGCCTCGATACGCCGCCACCAGTGACT 
CGACGCCCTGGAGAGGACAAATACCGCATTCGCAACGGCGGAAACACTCGCCTGGAAATTCTCAACGACCTCTACAAAGA 
GACCGGAGACGAGCGCTATTTCAGCTTCGACTGCCTGTTCAAGCCCTGGGACAAGCAGCGCGGCGAAATCATCGCGCTGA 
CCGGTCATTTGGCCGAGAACGATCTGAAGGGCGACCTCAAGTTCATCGAGCGCGCGGTTGGGGTGCAGAAGGCGAAATTT 
CTTTACGAACAAGAGAACGGCGGTGAAAGCATTTCCCAGCGCGAGTTGGCACGTCGGCTAAAAGCGGACGGCTACCCTGT 
ATCTCAATCCCATATCAGTAAGATGTTAGACACTATTGAGGTATTGGCGCCGGCGATTCCTGTGATGCTGTATTCAGGGC 
TCGGTAAACCGCAAATCGAGAAACTCCTGTCACTCAGAAAGTCGGCATCCTCCTGCTGGGCACGTCTATACGCTGGTGAA 
GGGGTTGACTTCGAAATGCTGTTCCAGGACACCCTGGCAATCTTCGACAGTAGCCCTGACGAATTCATTTTCGAGCGTTT 
CCAGGACGAACTCATCGACCAAATGAAGCGCCCCCTGGGCCTGCGTTATGACCAAATCCTGCTCGAGATTACCAACGGGC 
AG CAGG AGC AACG CCG CGG C ACTCTGGTCG ACCTG CCC ACACCTGC CG C ACCACCTC AACTCCCACC AATTGGGC AGGAA 
AACCCTGCTGCGTCGTCTACTGGACAAGCACAAACACAGAGCCCCGCCGCAGATCCCCAAACGTCCAGGACAAGGAGCAA 
CCCGGGTAATCCCCTCCCCCCGCCGGCTCCGCCACCACCTGTCCAACAAAAGCAATTGCCCGATGAGGAGCGTGCGGCGG 
TCTTGGCAGGCCATATCGTGAGCCCGGTATCGACTAAGATCCAGCAGACTCGCCAACGGCTGGCCGGCCTCGAGGGGGAA 
CATCTACCTGTCTTCGATGAAACAGCTCTGCAGGCAATCCGAGTGCAAGTCGGTGGCCTGCACCCGATeAGCGATCTGTG 
~ GTAGATCG AG GGGTCG ATCG AT ACCCCCG AG ATCCTG CGAG AG GACATCGCTG ATCTGG GTG AAG AG ATGGGTCTGC ATG 
TCGGCGCCCCAGGCGAGATCGTCAGGATTCAGGGCGGTGTGGGTTACACGTATCGCGAGCCCAATGAAGACCATGAGATT 
ACTG ATTC AG CG CTG CACCT C ATG ACGCTGCTTC AAG CGGT C AG CG GG CAGGTCCAAGTCGTTCTG AAC ACTC ACG ATCA 
ACAGACCTGCCGCGATGCACTGGGTGAATTCCAGTTCTCAGCTGGCCTCGCTCAGTTGCTGCTGGGCCAACCCACCACAA 
GTGACAAGCCATCCTGCCAGGCAGGCCGTCTCAATGACGAAGCCCTGGTGAAACTGTTCCGGATCATTCGTCTTGCCCGA 
CGCCTGGTTGACCTTGAGCTGCCGCCGGCCGCCTCCGAGCAAGCAGCTACTGACCAGTGA 

Protein sequence: (SEQ ID NO: 203) 
MS^KINPQDLKDRLIAPGFTAPPKVLEQLSDPISDTPMRL^^ 

RRPGEDKYRIRNGGNTRLEIL^LYKETGDERYFSFDCLFKPWDKQRGEIIALTGHLAENDLKGDLKFIERAVGVQKAKF 
LYEQENGGESISQRELARRLKADGYPVSQSHISKMLDTIEVLAPAIPVMLYSGLGKPQIEKLLSLRKSASSCWARLYAGE 
GVDFEMLFQDTLAIFDSSPDEFIFERFQDELIDQMKRPLGLRYDQILLEITNGQQEQRRGTLVDLPTPAAPPQLPPIGQE 
NPAASSTGQAQTQSPAADPQTSRTRSNPGNPLPPPAPPPPVQQKQLPDEERAAVLAGHIVSPVSTKIQQTRQRLAGLEGE 
HLPVFDETALQAIPVQVGGLHPITDLWYIERSIDTPEILRQHIADLAEEIALHVGAPGEIVRIQGGVGYTYREPNEDHEI 
TDSALHLMTLLQAVSGQVQWI^THDQQTCRDAUSEFQFSAGIAQLLLGQPTTSDKPSCQAGRLNDEALVKLFRIIRLAR 
RLVDLELPPAASEQAATDQ . 



DNA sequence: (SEQ ID NO: 82) 

ATGACCATGGCCCGAGAAACCGAAGATAAGTTCGTTGTCCGTATGCCCTTGGGCTTGCGCGATCAGCTAAAGCAAAAAGC 
CGCGGATAACCACCGTTCGGCCAACAGCGAGATCGTCTACCGACTGGAGCGCAGCAACGCGCTCGAAGAAGAACTCGCGC 
GAGCAAACCGAATGGTCGACGAACTCTTCGCCAAGAACCAGCGCCTGCAGGCTGAGCTGGCGGCGGCGAACACGCCTCAG 
GTGGCGGAGGCATGA 

Protein sequence: (SEQ ID NO: 204) 

MTMARETEDKFWRMP1X3LRDQLKQKAADNHRSANSEIWRLERSNALEEE 
VAEA 

RL104 

DNA sequence: (SEQ ID NO: 83) 

ATGCCTATCAAACACGCCATCGTCCACCTGATCGAGAAGAAGCCCGACGGCACCCCGGCCGTGCTCCACGCGCGCGACGC 
CGAGCTGGGCGACTCCCAGGCCATCGAGTiACCTGCTGGCCGACCTCAACGAAAGCTACAACGCCAAGAACAAGGCCTGGG 
GCTTCTTCCAGGGCGAGTCCGGGGCCTACCCGTTCAGCGGCTGGCTCGGCGAGTACCTGGAGGGCGACCGCGACTTCGTC 
GGCTTCAGCCGCGAAGCGGTCGAGCACCTGCAAAAGCTGATGGAGGAGTCCAATCTCTTCACCGGCGGCCACGTCCTGTT 
CGCCC ACT AC C AGC AAGG C ATG ACCG ACTACCTGGCG ATCG CC CTGCTG C AC CACAGCG AAGGCGTGGCGGTG AACGAGT 
CGCTGGAGGTCACCCCGTCGCGCCACCTGGACCTCGGCCAGTTGCACCTGGCCGCGCGGATCAACATTTCCGAATGGCGC 
AAC AA C AAGCAG T CG AAG C AG T AC ATCT CGTTCATC AAG GG C AAGGG CGGG AGGAAGGTCTCCGACT ATTTCCGCG ACTT 
CATCGGCTGCCAGGAAGGGGTGGATTCGCCGAGCGAGACGCGCACCCTGCTGAAAGCCTTCAGCGATTTCGTGGAAAGCG 
AGGACATGGCCGAGGAACAGGCCCGCGAGAAGACCGAGACGCTGGTCGACTACGCCACCTCGCAGGCGCGCATCGGCGAG 
CCGATGACCCTCGACGCGCTTTCGGAACTGATGGACGACCAGCAACCGCGGGCGTTCTACGACTACATCCGTAACAAGGA 
XTACGGCCTGTCGCCG.GAAATCCCGGCGGACAAGCGCACCCTCAACCAGTTCCGCCGCTTCACCGGCCGCGCCGAAGGCC 
""TGTCGATCXGCTTCGAGGCGCACCTGCTGGGCTCCAGGATCGAGTACGACGAGGAGCGCGACACGCTGCAGATCAGCAGC 
CTCCCCACTCAACTCCGCGACCAGCTCAAGCGGCGCAAGGCCCAAATTGGAGAATGA 



ID NO: 81) 
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Protein sequence: (SEQ ID NO: 205) 

MPIKHAI VHLI EKKPDGTPAVLHARDAELGDSQAI ENLLADLNES YNAKNKAWGFFQGESGAYPFSGWLGEYLEGDRDFV 
GFSREAVEHLQKLMEESNLFTGGHVLFAHYQQGMTDY^ 

NNKQS KQYI SF I KGKGGRKVSDYFRDFI GCQEGVDS PSETRTLLKAFSDFVESEDMAEEQAREKTETLVDYATSQARI GE 

PMTUDALSELMDDQQPRAFYDYIRNKDYGLSPEIPADKRTLNQFRRFTGRAEGLSISFEAHLLGSRIEYDEERDTLQISS 
LPTQLRDQLKRRKAQIGE . 

RL105 

DNA sequence: (SEQ ID NO: 84) 

ATGCGTAGTTTCCTTCGCGGCGCCCGGGAAAGCGTTCGCCGGCTGGTGGCCTTCGCTCAAGCAGAAGGCTGGAGCGTCGA 
CCGCTCCGCAGGCGGCCACTTGAAGCTCAGCAAGATCGGCTGCGCCTCGATCTTCATTTCTTCCACGCCAAGCGACGCAC 
G CGG CG AG CT C AATGCC CG CGC C CTGCTCCG TCG AG CCG AC AGG CAGCGTTCC CTG AAC CAGG AGTCTTTCTGA 

Protein sequence: (SEQ ID NO: 206) 

MRSFLRGARESVRRLVAFAQAEGWSVDRSAGGHLKLSKIGCASIFISSTPSDARGELNARALLRRADRQRSLNQESF 
RL106 

-DNA Sequence : HSEQ ID NO: 85) - - ----- — , - - - _ 

ATGGCTGACGTCACAGCCTACCGGCCGCTCGAGCACTTCCAGAAAGTCGAGCTGATGCTTGAGCTCAAGTTGCGTGAAGG 
TCCTTCGTGGATCTGTCTGAACTGCGGCTATCACCTGGATGGCAGCGGCGCACAGCCCTGCCCTGACTGCGGAAAGTCGC 
GCTACTG G ACCAG CGG TTG GAG TG T AGGTCGTGG C C ATCG CTTCTCGGCAGC AAGGG AAGAGTGGG AAAAC CGCCTCAGG 

ACACGGTCGCGGTCACCTGTCGCGTCAACGGCACCAGTAGCAACTGACGACGTATGCACTCAACTGCGCACAGAGGTCCG 

CATGCTGCGTTCCGCGCATGACGACCTGGCCTGCAGCCGGCAGAGCGATCGTCGCAGCCTTCAGGCGCTGGTGAAACGTC 

TCCTGGATGCCGCCGCCACCGATAGCCTTCCCCGCTCCCTTGCAGAGATGGAGACCTGGCTGCAGCTCAACAGCGAGGAG 
ACCACGAATGCGTAG 

Protein sequence: (SEQ ID NO: 207) 

MPDVTAYRPLEHFQKVELMLELKLREGPSWICLNCGYHLDGSGAQPCPDCGKSRYWTSGWSVGRGHRFSAAREEWENRLR 

TRSRSPVASTAPVATDDVCTQLRTEVRMLRSAHDDLACSRQSDRRSLQALVKRLLDAAATDSLPRSLAEMETWLQLNSEE 
TTNA. 

RLtOj 

DNA sequence: (SEQ ID NO: 86) 

ATGAAGGCGTCCCAGACCTATCAGTGCATCGTCAAGTTCGATGGCGCCGGTTTCTGGACCAATACCATTCAGAAGCAGCG 
TGCGACCTGCACCTGGAGCGACAAGGTGGCAGCCTCCCGCCTTGCCGAACGACTGTTTGGCGAGGACAACGCATACATCA 
CCCGTATGCCGGT AC AGG CAGG CG AC CACGAAAAG CG C AT CG AG AG CCG CTGGGCG CTGTCCTGTAG AAATCC CAAGGAG 
GTAGCG CGCG ATGC CTGA 

Protein sequence (SEQ ID NO: 208) 

MKASQTYQCIVKFDGAGFWTNTIQKQRATCTWSDKVAASRLAERLFGEDNAYITRMPVQAGDHEKRIESRWALSCRNPKE 
VARDA 

Riip8 

DNA sequence: (SEQ ID NO: 87) 

ATGAACACTGAAGCCCGCTTTCCGAGTATCCACGCCTCGGCCGCGTTCACCGACTCGGCAGTGGTTCATGCCAATCACGT 
TGGGGTCAACCCCATCGAGCTGGACGCCCTCAGCCAAGTGATCTCGCGCCTTTCGCGGGACGAGAGCACGGTCGCACCCA 
GTTCGATGGAGCGAGAGCTTCGTGAGCTGGAGGAACTGGGGTACATCGAAATCTCGACCACCCAGGCCGGGACTCTGGTG 
GTCACTACGCGCGCTCCGGGGCAATTGCTTTCGGCTTACTTCTGGTCGGTATGGATCCCGCGACACCTGTTCAGCTGCTC 
GCTGAAAGTGAGCCTGGTGCCGCACCTCTGCTGCGGCACTCAGGACTCCCAGCACCTCACCGCCGTGTTCCGCATTGCAG 
GC AG C AAG G A CG CCG CG CGCG AGTTCCTGCATCAGTTGG C CAACAACTATC C CGGGC ATG AGC CGG AGTTGCCCG AACTG 
GTGGCCGTTCAGGTCGGTGATGCACTCAGCAAGGAGGCCGAGTCATGA 

Protein sequence: (SEQ ID NO: 209) 
MNTEARFPSIHASAAFTDSAVVHANHVGVNPIELDALSQV^ 

VTTRAPGQLLSAYFWSWIPRHLFSCSLKVSLVPHLCCGTQDSQHLTAVFRIAGSKDAAREFLHQLANNYPGHEPELPEL 
VAVQVGDALSKEAES . 
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DNA sequence: (SEQ ID NO: 88) 

GTTTCTGC AG AT C AGCCGGG AGG ACT CG AAG TG AGTGTTCTGG AACTT ACG CCGCCG C ACTC CG TCG AGG CGG AGCAAGG 
GGT ACT CGGCGG C CTG ATG CTGG ACAACG CGG CATGGG ACATTGTCGG CG ATC AGTTGC AG AAGG AGGACTTCTTCCGGC 
ATGAGCATCGGCTGATCTTCACCGCCATCAGCGAGTTGGCCGCGAAGGATGCTCCGTTTGATGTCGTGACTGTGTCGGAA 
GCGATCGAAGACCTTCCAGAAGCTGGCGGGCTGGCCTACCTCGGCCAGCTCGCCGACAACACGCCCTCCGTGGCCAATAT 
CGAGGCTTACGCGCAGATCGTTCGCGATCGGGCACACCTGCGGCAGCTGATGTCTCTCGGGCACCACTGCACCAGGACCG 
CCTCGAACCACCAGGCAAATCCCTCTGAGGTTCAGGAGGAGATTGAGCAGAAGCTGTTCGGCCCTTGGCCAGGACCACCA 
CAACGCCGATTTCGTCGATATCAACAAGAGTCTCACGAAGATCGTCGACACCATCGATTACCGCTTCAACAACAACGTGA 
CGGTAACGGGGGTCCCGACTGGCCTGAAGGATCTCGACGCACTCACCGGCGGACTACAGAAGTCGGATCTCATCATCGTC 
GGTGCCCGCCCCGCGATGGGCAAAACGTCGTTTGCCCTCAACCTGGTCGACACCGCGCTCCAGAGCGACCAACAGAAGTC 
TGTTCAGGTGTACAGCATGGAGATGCCGGCAGAGCAGTTGCTGTTCAGGCTTGCCGCCCTGTTCGGCCACCTGGACCTGG 
GCAAGCTGATGAAGGGCCAACTGCAAGAAGAGGATTGGCCCAGACTGTCTGGCGCGATCCAGCGCATAAACGACTATGGC 
AGCCGGCTGGTCATCAACGATCAGGGCAACCTCACGCCGACAGAGCTGCGCGCCAAGGTTCGCCGGGCGGCCAGGAAGTA 
CGGACACCCCGCGCTGATATTGGTCGACTACCTGCAACTGATGAGTTGCCCAGGCCTGGAGAATCGAGCCACCGAGATCT 
CGGAAATCTCCCGCTCGCTGAAAGCGCTGGCCAAGGAGATGGACTGTCCCGTCGTAGCTCTATCCCAGCTAAATCGCGGC 
CT AG AG AACCGG ACG AAC AAG CG AC CG AACTG CG CG GACCTACG AG AG AGCGG CGC AAT CG AG C AGG ACG CGG ACGTG AT 
CATGTTCGTGTACCGCGACGAGGTCTACCACCCCAACACCGAGGCCAAGGGCATCGCCGAAATCATCATCGGCAAGTATC 

GCAACGGTCCGATCGGCACCGTCCACACCGCCTTCATCGCCAACCAGACCCGCTTTGCCGACCTGGCGCCGGGGACCTGG 

CAA " • ' - '• ...... 



RL110 

DNA sequence: (SEQ ID NO: 89) 

ATGACTCGCTCTGCTCTCTCGACCATCGCCTACGAGGCCCTGGTGCGTGCCCGCCGCAAATTCAGCAACCGAdAGGAGCG 
CTGCATCCGCGAAACCTGGACCGCCGAACAGGAACTGGTGCTGCTGCGCCTGTATCCGGATATGCCGAACGAGGTCCTGG 
CAGCCAGGTTGAACAAAACGCTCCAGCAGATCTGCTCCAGAGCGTATCGGCTCGGGCTGAAAAAAAGCCCTGAGTTCTCC 
AAGAAGATCCGGCAGGACTGGGGCAGCGCAACTCGGTTCAAGAAGGGAAACACCCCATGGAACTGCGGCATGAAGGGGCT 
GCCCGCGCGAGGACGCGCACCAGAAACGCAGTTCAAGAAGGGGCAAAAGCCCCACACATGGCTCCCAGTCGGCAGCACGC 
GGG T CAG CG CTG ATGG CT AC CTG CAACG AAAG ATCT CGG AT ACCGGCT ATCC C CCCCGGG ACTGG AAGGG CATCCACATC 
CTGCTCTGGGAAGAACACTTCGGCCCCATCCCAACCGGCCATTGCGTCTGCTTCAAGGACAACAACAAGCAGAACGTCGT 
CATCGACAACCTGGAGCTCATCACCCGGGCCGAACGCATGCGCCGCAACTCCATCCATCGCTATCCACCTGAGCTGAAGA 
GCGCAATCCGCGTCATCAGCAAGCTCAAACGCACCATTCAGGAGGTCGAGCATGAAGAACAAGATTGA 

Protein sequence: (SEQ ID NO: 210) 
MTRSALSTIAYEALVRARRKFSNREERCIRETWTAEQELVLLRLYP 

KKIRQDWGSATRFKKGNTPWNCGMKGLPARGRAPETQFKKGQKPHTWLPVGSTRVSADGYLQRKISDTGYPPRDWKGIHI 
LLWEEHFGPIPTGHCVCFKDNNKQNWIDNLELITRAERMRRNSIHRYPPELKSAIRVISKLKRTIQEVEHEEQD 

DNA sequence: (SEQ ID NO: 90) 

ATGGACAAGCAAAAAGTCCTCGCCAAGGTCGAGAAGCTGATGGCCCTGGCGAATGCCAAGGGGGCCACGCCGAACGAGGC 
GGAAACCGCATTGCGCCAGGCCGCGATCCTGAAGCGGCAGTTCGACCTCAGCGATGCGGAGATCTCGGCCCACACGGTGG 
AAAC CG CGTG CGTTC CC ACT CG AACC AGG CG CTCTCCTG CC C CATGG CTG C ATG AACTGGC CGGGAT CTG CGCC AGTTCC 

TTCGGCTGCGACTACCTGGCGGCATACGCGATGCCAGCGGGCTGGACGTTCAAGTTCATGGGCCGAGGGATCGGCCCTGA 
GCTGGCCGCTCACGCCTACTCTACGCTCCACCACCAACTGGTGGCAGCGCGCTCGGCTCATGTCGCCCAACAGT^AGCGCT 
GCAAGCTGTCGACCAAGCGTCGTCGCAGCAAGCTCTTCGTCGAAGGCTGGCTTCTCGCAGTGCGTTCGCTGGTACGTGAA 
TTTGCTGGCAGGCCGGACGAGTCGACTCAAGCAGCCATCAAGGCCTACCTCGAACTACACCATCCGGCGTTGAAGTACCT 
GGAGCCGGCGGCGCTTACGAAGGCCCTTGCCTATGACCAGGCCTCGCTGCAAGCAGGCTGGGAGCACGGCAAAAACACTC 
GCCTGCACCGCGGTGTCAGCCGGCGAGTTCAGGGCGCGCTCGAGCAGGGAGGTTCCCAATGA 

Protein sequence: (SEQ ID NO: 211) 
MDKQKVLAKVEKLMALANAKGATPNEAETALRQAAILKRQFDL^ 

FGCDYLAAYAMPAGWTFKFMGRGIGPEIAAHAYSTLHHQLVAARSAHVAQQKRCKLST^RRRSKLFVEGWLLAVRS 
FAG R PDE S TQ AAI KA Y LELHH P ALK YLE P AALT KALAYDQA S LQAGWEHG KNTRLHRG VSRR VQGALEQGG SQ 
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RL112 

DNA sequence: (SEQ ID NO: 91) 

ATGAGTGACCCCAAGCTCAAGCCCTGCCCGCTCTGCGGCAGCACGAACATTCGAATGCTGGAACCCGAGCTGCTCGACAC 
CGATGCCTGGAACTGTGCCATTGAATGCCTGGACTGCCAGGTTCACATCGGGCCGTCCTACTGCGAGCCAGACCCGGTAA 
CAGCGAGGTATTCAGCACAGATCGACTGGAATAGACGCCCAAGCGCAAAAAACCACGCGGACGAGCGTGAGCAGTTCTTG 
ATGGCCAACCTGCTCGCCGCCCTGGAGGTCGCACTGGGCGACGTAGCAGCCCTGGCTATTGTCGATCGGGTAAGACAGGC 
CACAGACCGAATTTACCCAACTTCGAACCTCTCCCCTGTTCCGCAGGCCTGGCTCGATGTACAGGCCGAGCGCCGGCGCC 
AGATCACCGTCGAAGGTTTCGATACCAGCAACGACGACGCTAGCGCTGGCCTGATCGCCCTGGCGGCCGGCTGCTACGCG 
CTCCATGCCGGCGGCATCGGCACCGACTGGCCGGGCGGCATTCGGAATGGCTCTGCACTGTTCTGGCCCTGGGACGAAGA 
GTGGTGGAAGCCTAAGTCGGCGCGCGAGAACCTGGTACGCGCCGGCGCCCTAGTGCTGGCCGAGATCGAGCGCCTGGACC 
GCTCCGCCACCGAGCAGGGCTCAACCATCTGCAAGGGGGGCGCGTAA 

Protein sequence: (SEQ ID NO: 212) 

MSDPKLKPCPLCGSTNIRMLEPELLDTDAWNCAIECLDCQVHIGPSYCEPDPVTARYSAQIDWNRRPSAKNHADEREQFL 
MANLLAALE VALGDVAALAI VDR VRQATDRI YPTSNLS P VPQAWLDVQAERRRQI TVEGFDTSNDDAS AGLI ALAAGCYA 
LHAGGIGTDWPGGIRNGSALFWPWDEEWWKPKSARENLVRAGALVLAEIERLDRSATEQGSTICKGGA 

RL1|:3 - 

DNA sequence: (SEQ ID NO: 92) 

ATGAACCTCCAGAACCGCAACAACCTCCTACTGAGCTTGATCGCCGAGACCCAGTTCGACGCCTACGTGCAAGGCTACAT 

GGCCAAAGCAGGCGCTGCCGCCGGTGCTTCCGAGAATCTGCAAATCGAGGCTGAAGGTGCTGCGATGTTGCAGGGCCTGG 

TCGCTCCGGTTCGCGCTCAGCAGCGTGCCTGTGGACAGTCCCTGCAGAACGCACTGCTCCAAATCGCCCACGACCTACTG 

TTGCAGACGAAATCGCAACTGGCCATCGCGGCCAATGCCAGTTCGATCCAAGTGATCCAGCGGGACATGAACAGGGCGAT 

CTGGAACATAGCTACTGCCATCGATCACCTGGCCGAGTTCGCCCAACCCTCGCAGGACACTGTGAGGGTCATCGAACGGC 

TGATGCTCTTCGTCGGCAGCTCATCAAGCACTGAAGGCCAGCAACTGGCCGCCGAGGCAAATGCGGTGCTCGGCATGAGC 
G TGGGAGG CCTGG CATGA 

Protein sequence: (SEQ ID NO: 213) 

MNLQNRNNLLLSLIAETQFDAYVQGYMAKAGAAAGASENLQIEAEGAAMLQGLVAPVRAQQRACGQSLQNALLQIAHDLL 
LQT^QIAIAANASSIQVIQRDMNRAIVWIATAIDH^ 

rlm* 

DNA sequence: (SEQ ID NO: 93) 

CTGAACAAGTTCGGCAGCGCCGCCGACCTTCGGAGCCAGCAGGCCAAATTGACCGGCGCTACGCGAGAAATACGCAAGCT 
G ACTGGTGG CGG T ATCG AC CTG TTCG GG AAG CTGGGTTGCTACTTG AG CTTCG AAC AAAAGCAGCTCCT AC AAG ACGC AG 
CGCGCTTGCTCGACTCGGTGAACAAGCAGATCGAGCATGCGAAGGAAAAGCGTGATCGCTACGAGAAAAAAGCCAAGAAG 
CGGCGCGAGCTACGTGAGCGCCTGGCCAAGCAACTGGTCGCCTCGAACTACCCGCTTCCGGGAAATACGCTCGAAGATCG 
GCTGGAAATCCTGCAGATCGCGTTGATCTACAACCGGGCCAGGGTGTTCGATCACCTGTACTCCACGCACCAGCTCCACT 
CAAAACTCAAACGCTGGCTGGAGCGTCCAAAGCAGCTCATCGGATGGCGCAGTGAAGCCGAGTATTTCGCTAGTCAGGTG 
GGG AG C CTG CG ATGTG ACTTC ATT AG C CATCTG ACT AACG AAATCG CG T ACG ACG ATGG CAGTG AAGTCG AGG AG CG CCT 
GCGCGTCATCAAGCAGAAGGTCGCTGACTGCACCGCACAGATCGCTCTGACCAGCGAGGAGCAGGAAACCCTTCGGCTCT 
GGACAGACGCTCTGCAATCGGCTCCGGAGGGCCTCATATGA 

Protein sequence: (SEQ ID NO: 214) 

LNKFGSAADLRSQQAKLTGATREIRKLTGGGIDLFGKLGCYLSFEQKQLLQDAARLLDSVNKQIEHAKEKRDRYEKKAKK 
RRELRERLAKQLVASNYPLPGNTLEDRLEILQIALIYNRARVFDHLYSTHQLHSKLKRWLERPKQLIGWRSEAEYFASQV 
GSLRCDFI SHLTNEI AYDDGSEVEERLRVI KQKVADCTAQI ALTS EEQETLRLWTDALQ SAP EGLI 



RL115 

DNA sequence: (SEQ ID NO: 94) 

ATG AATG CG AAAGCG ACTTCGGTTG TATC C AC C AAGGGTGGTGT AGG AAAATC CAC C ACCGCCGCCAACCT CGGTGC ATT 
TTGCGCCGATGCAGGCATACGAACCCTCCTCATCGATCTGGACCCCGTCCAGCCCTCCCTATCCTCGTACTACGAGCTGC 
CGGAAGTTGCCCAGGGCGGCATTTACGACCTGCTCGCCGCCAACATAACGGACCCGGCGAGGATCATCTCCAGGACGATT 
ATCCCCAATCTGGACGTCGTGATTTCCAACGACCAGAACAATCAGCTCAACAACCTACTGCTCCAGGCGCCCGATGGCCG 
GCTACGCCTGGCGAACCTGATGCCCGCTCTGAAAGAAGGCTACGACCTGGTGCTGATCGACACCCAGGGTGCGCGCTCAG 
CTTTGCTCGAAATGGTTGTGCTTGCATCGGACCTGGTTGTTTCCCCCCTCCAACCCAACATGCTTACCGCCCGTGAGTTC 
AACCGCGGCACCATGCAAATGCTCGACGGCCTACGCCCCTATGAGCGTCTCGGCATGCGGATCCCCAATGTTCAGATCGT 
CATCAACTGCCTGGACCAGACCAATGACTCCCGGGCAATTCACGAGAATGTGCGTGCCATCTTCGATGAGCATCAGGACA 
TTTCTGTGCTCGAAACGACTGTCCCGGATGCCGTCGTGTTTCGCAACGCAGCATCGCGCGGGCTACCAGCGCACCGCCTC 
GAAACGCGGCAACCCTCCAATCGCACATCAGCGCCCGCGCTGGAAATCATTCGAAACCTGGCCATCGAGGTCTTTCCCGA 
GTGGACTGACCGCTTCCTGGCGCTGACGCCGGGAGGCGGTTGCAGCACTGGTCAAGGGAGGGCGCTGACATGGCGAAGAC 
TCCTATCACCCAAGCCCGCGACGTCGACGCGGAACTTGTGCTGGAACTGA 
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Protein sequence: (SEQ ID NO: 215) 

MNAKATSWSTKGGVGKSTTAANUSAFCADAGIRTLLIDLDPVQPSLSSYYELPEVAQGGIYDLIJU^ITDPARIISRTI 

IFNLDWISNDQm^QLNNLLLQA 

NRGTMQMLDGLRPYERIX3MRIPNVQIVINCLDQTND 

ETRQPSNRTSAPALEIIRNLAIEVFPEWTDRFLALTPGGGCSTGQGRALTWRRLLSPKPATSTRNLCWN 
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RS01 

DNA sequence: (SEQ ID NO: 95) 

ATGGGGATCTACCGCCCGAAGCAGTCTCGCCTAGCGATACCGATACTGAGGGGCCGGCTACCGGACGAAAGGTAGCTGCG 
CCTCCCAGCAGTTCGCTAGGCCTGTAGGATkAAATCTGGAATTACCGAGAGCGCCTGGATTCCAGCGCCGGCATGCTGGCA 
GAGCCCCGCAATTTCAAGGCCGAAACCGCAGTACCCTCTGTAATCGCTGATTACGTCGAGGGCACATTGCTACGCCTGCA 
GAATGGTTTCAGGGCCTGAAAAACAGAAAAGCCCACCTAAATAGGCGGGCTATTCCATATTGACATCACGTCAATGCGGG 
CC 



RS02 

DNA sequence: (SEQ ID NO: 96) 

--ATGAGGCGGCAGCAGCTCACCGAGGAGTACATCTTCGCGCACGATCTCCGAGAAGCCAGCGCGAAGATCTACCGCGCCGC 
GACCAAGGCGCTGCTCAAGCACTTCGGCCCTACGGCAACCGTACAGGAGGTGGACCACAGGTCTGTGCTGGGATGGCGGC 
GCAAGGTCCTGGAACAAGGCCTGTCGAAGCGGAGCTGGAACACGTATTCGAATCATCTGCGAACGATCTGGGGCTATGCC 
ATCGAGCACGAGCTGGTGACACACTCCCAAGTCAACCCGTTCAGAAAGACCACCGTCATCCCCCCCAGGCGAGCAAGCAA 
AACCGTCGCAGCCGAAGCCATCCTGCGCGCCCGCAATTGGCTCAACATGCAGGTCGGCGCCGAGCGCTGCACTGGCGATC 
GCGCACGCATCACTCCCGCCTGGTTCTGGCTTTGCACGTTTGAGGTCTTCTACTTCACCGGCATCCGGTTGAATGCGCTG 
TTGTGCATCCGCAAGCGCGACATCGACTGGGAAAATCAACTGATCCTCATCCGCGGCGAGACAGAGAAAACTCATAAAGA 
GTTCGTAG TG CC AAT AACG G AGG GGCTTG TG C CTCA C CT AT CGCGGCTC CTG CAGG AGGCCG ATAGAG CCGG ATTCGCCG 
ATGACGACCAGTTGTTCAACGTCAACCGGTTCTCACCGCACTACAAGAGCAAGGTGATGAACTCCGACCAGGTCGAAGCC 
ATGTACCGGAAGTTGACCGAGAAGGTTGGGGTGCGGATGACTCCGCACCGTTTCCGGCACACCCTGGCCACCGACTTGAT 
GAAGGCACCCGAGCGGAACATCCACCTCACG7VAGTGCCTGCTCAACCACTCGAATATCCAGACCACCATGAGCTACATCG 
AGGCCGACTACGACCACATGCGTGCCGTGCTGCATGCCAGAAGCCTGGCCCAAGGAGCGCTGGAGAACGTCAGGAAGGTG 
GATTACAGCGGCTCCCCGCAAGCCTCTGCCAAACCGAAGCCATGCGGGCAACCTCTCGCTCGAATGGGTGAAGCGCCGCC 
ACAGGAGGCTAGGACAGAACCTGCAGAACCAAGGGAGCACACACCAGGGACAGGCATTCAGGGAGATGCAACCGCGTGGG 
AAGAAGCGCTACCACAGCCACCTGACACCTTCGAGCAAAGCGTGCTGTTCACTCTGATGGCTCAACACCTATCGAACCGT 
GCCGCCACGGCCTCCGCGGCTTCCACCGCAACAAGCGGATCTGGAGGATGGGGATCTACCGCCCGAAGCAGTCTCGCCTA 
G 



Protein sequence: (seq id no: 216) 

MTPQQLTEEYIFAHDLREASAKIYRAATKALLKHFGPTATVQEVDHRSVl^WRRKVLEQGLSKRSWNTYSNHLRTIWGYA 
IEHELVTHSQWPFRKTTVIPPRRASKTVAAEAILRARNWLNMQVGAERCTGDRARITPAWFWIiCTFEVFYFTGIRLNAL 
LCIRKRDIDWENQLILIRGETEKTHKEFVVPITEGLVPHLSRLLQEADRAGFADDDQLFNVNRFSPHYKSKVMNSDQV^ 
MYRKLTEKVGVRMTPHRFRHTLATDLMKAPERNIHL^ 

DYSGSPQASAKPKPCGQPLARMGEAPPQEARTEPAEPREHTPGTGIQGDATAWEEALPQPPDTFEQSVLFTLMAQHLSNR 
AATASAASTATSGSGGWGSTARSSLA . 

RS03 

DNA sequence: (SEQ ID NO: 97) 

ATGAAATCTGGTATCGCGACCCGTCGCCTGTTCATCAACGACACCAAGGCTTTGGTGCATACCGTCGACGGGACCGCCAT 
GCTGGTCACGCCAGGAATCTTCAAGCGTTATGTCCAGGAGCATCCGGAGGTTGAAAAGCTGGCCCAGGCCAAGGAGACCG 
CCGG CTGG AAG CTGGTG CAG CG CG CGTT CG AG AAACAGGG TCTTC ACCG AAAG ACCAGTAAG AACCTGAATATCTGG ACC 
ATCAAGGTTTCTGGTCCTCGCAAGACGAAAGAGCTCAAGGCCTACCTGCTCCAGGATCCCAAATTGCTGTTCCCTGTGCA 
GCCTCTGGACAACCCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGA 

Protein sequence: <seq id no-. 217) 

MKSGIATRRLFINDTKALVHTVDGTAMLVTPGIFKRYVQEHPE 
IKVSGPRKTKELKAYLLQDPKLLFPVQPLDNPSLTVITDAEGGVE 
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RS04 



DNA sequence: (SEQ ID NO: 98) 

AT AGAC C AGTTG AGTGAGCAGGAGTCGGTGGAAGTGGTCTGCT CAGCTTT CGATGTGGCGCGGT CTTGCT ACT ACGT C CA 
CCGTCTTCGACGGCGGCGTGTCGATGCTCGCCGCGTGGCGCTACGCAGCCAAGTCAACCAGTTGTTCAGCCAGAGTCGGG 
GCTCGGCCGGCAGCCGCAGCATTCTGGGCATGCTGCGCGAAGAGGGCGTGACCATCGGCCGTTTCCGAGTGCGTCGGTTG 
ATGCGTGAGCTGGGCCTGGTCAGCAAGCAACCGGGCTCGCACGCCTACAAACAGGCCACGGTTGAGCGGCCGGATATCCC 
GAATCGGCTGAACCGCGAATTCGCGACCGAGCATCCCATACAGGTGTGGTGTGGCGACATCACCTACGTCTGGGCGCAAG 
GCCGTTGGCACTACCTGGCCGCGGTGCTGGATCTGCTGATCGGCTGGGCGTTCTCGGCCAAGCCGGATGCCGAACTGGTG 
ATCAAGGCC CTGG AC ATGGC C T ACGAAC AGCGCGGC AGGC C AC AGC AGGTGCTGTTCC ATT CAG AC C AGGG C AGC C AGTA 
CGCCAGCCGCCTGTTTCGGCAACGGCTCTGGCGCTATCGGATGCAGCAGAGCATGAGCCGTCGGGGGAATTGCTGGGATA 
ACTCGCCGATGGAGCGCCTGTTCCGCAGTCTGAAGTCGGAGTGGGTCCCGTCAACGGGTTACCTGACGGCGCAGGAGGCC 
C AACGGG AC A TC AGT C ATT ACTTG ATG C AC CG C TAC AACTGG AT CAGGC CGC ATC AATTC AACGACGGGTT ACCACCTGC 
GGTGGCCGAAGAAAAACTCAACCCACTGTCCGGGATGGGTTGA 

Protein sequence: (SEQ ID NO: 218) 

I DQL S E QE S VE WCS A FD VARS C Y Y VH RL RRRR VD ARR VAL RS Q VNQLF S QS RGS AG S RS I LGMLRE EG VT I GR F RVRRL 
MRELGLVSKQPGSHAYKQATVERPDIPNRLNREFATEHPIQVWCGDITYVWAQGRWHYLAAVLDLLIGWAFSAKPDAELV 
IKALDMAYEQRGRPQQVLFHSDQGSQYASRLFRQRLWRYRMQQSMSRRGNCWDNSPMERLFRSLKSEWVPSTGYLTAQEA 
QRD I S H YLMH RYNW I R PHQFNDGL P PA VAE EKLNPL SGMG 

RS05 

DNA sequence: (SEQ ID NO: 282) 

ATGAGCAAGCAACGACGTACGTTTTCCGCCGAGTTCAAACGAGAGGCCGCGGCCCTGGTGTTGGACCAAGGCTACAGCCA 
TATCGACGCCTGCCGTTCGCTGGGGGTGGTGGATTCGGCCTTGCGCCGTTGGGTGAAGCAGCTCGAGGCGGAGCGCCAGG 
GTGTGACCCCGAAGAGCAAGGCGTTGACGCCTGAGCAGCAAAAGATCCAGGAGCTGGAAGCCCGGATCAACCGATTGGAG 
CGGGAGAAAGCGATATTAAAAAAGGCTACCGCTCTCTTGATGTCGGACGAACTCGATCGTACGCGCTGA 

Protein sequence: (seq id no: 219) 

MSKQRRTFSAEFKREAAALVLDQGYSHIDACRSLGWDSALRRWVKQLEAERQGVTPKSKALTPEQQKIQELEARINRLE 
REKAILKKATALLMSDELDRTR 

RS06 

DNA sequence: (SEQ ID NO: 99) 

ATGTTGTATTTTTCTTGCAGTATGAAGATGGGTGGTTGGGTCGGATATAGGTACTTCTCTCTATTTTCTTTAATTGCTCT 
CATCTATGGGTGTGTCGGTGGTGGAGGTGGATCGGATGAGATTGGGCAGCACTGCTTTGAGAGAGAGCAAAAGCTTTCCG 
GAGTTAATGATAATGAAGAGGGGAGTGTGAGGTTGAATCGGCTGAACTGCGATCCAATTGAAGGTCGTGTTCTTGAATCA 
GAGAAGCTGATAAGAAAGCCGCCCAATGAGCTGGGTATTCACTGA 

Protein sequence: (seq id no: 220) 

MLYFSCSMKMGGWVGYRYFSLFSLIALIYGCVGGGGGSDEIGQHCFEREQKLSGVNDNEEGSVRLNRLNCDPIEGRVLES 
EKLIRKPPNELGIH 

RS07 



DNA sequence: (SEQ ID NO: 100) 

ATGAAAAAATCACTTGTTATGTCGGCTGTGCTTTTGGTGGCTAGCAATTTCGCGTGTGCTGATGAGGGCTCAAATGATGG 
AAGTGAGATATGTCGGGCGCAGGGTGGAGTTGAAATAACAAGTCTGGGGGAAGTCTCAAAGGGTGTGGATGTTGAAGATG 
TTGTAGTTTGTTCGATTCTTCCAAGTAATATGAAGTCGAGTCAAAGAGCGCCTACACTCCCTCCTCTGCAAAGGATGATC 
ATTTCGGCAATGCCTTCACCAGGAACGGTCACTGTTTCTGCCAGCGGAGATAGGAAATTTACAACATCTTGCCGGGCAAA 
TCTTTATGCTCCACGTTATGCCAATTTCTATCCAGACGGTGTTAGCAGGGGAACATCAGATCTACGATGTGTTGGTTACA 
ATACACCCGGGAATTCATCTCAAGGGTGTAATGTGTCATGGGACGGCCCGACCGACATTCAATTGGGTGTTGAGCCATAT 
GGCGGATCTGTTGTTGTAAACTACAGTTGCACTGCATTCAAAACAACGATTCCAGTGATAATGAGCTACAGTTATCGTGA 
TGGG CGGGC AGTGT ATGGCG AGGT C CAG AATGTGT CAGG AAT AATAAATGTGGTT TTG AACTAA 
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Protein sequence: (seq id no: 221) 

MKKS LVMS AVLLV AS N F ACADEG S NDG S E I C RAQGG VE I T S 1X3 E V S KG VD VED VWCS I LP SNMKS S QRAPTLP PLQRMI 
ISAMPSPGTVTVSASGDRKFTTSCRANLYAPRyANFYPDGVSRGTSDLRCVGYNTPGNSSQGCNVSWDGPTDIQLGVEPY 
GGSWVNYSCTAFKTTIPVIMSYSYRDGRAVYGEVQNVSGIINWLN 

RS08 

DNA sequence: (SEQ ID NO: 101) 

ATGCTTATTAAAATTCTTCGAATTATATTCTTGTTGCCTATAGTTGGTTTGGCACAGCAGGCTGCTGCCTCCCCGCCCGC 
AGAGTCACACTCGGAACAATCTGAATCTTCGTGTATCGATGTCCAAGTCAATGGAGCACGTAGCCTGTCTTATAACTGCA 
TGGCTC AG C AAATG ACT CCACC CAAAG AGG ATCCTCGG CGTCGG AAC CCTAC CTTG AACTCCACATT AG CGTCTG AACGC 
GCCACTCGCCTGCCACCCACACAGACAGGACTTTTTACCAGCCTTCATCAACGTGCCATATCGAACTCGAAAGACTAG 

Protein sequence: (seq id no: 222) 

MLIKILRIIFLLPIVGLAQQAAASPPAESHSEQSESSCIDVQVNGARSLSYNCMAQQMTPPKEDPRRRNPTLNSTLASER 
_ . ATRLP PTQTG.LFT S.LHQ^ I SNS KD . . ._ „ _ 

RS09 

DNA sequence: (SEQ ID NO: 102) 

GTGAGTAGTACTAAGAGTAAGCCGATAGCCAGGGGGCGTGGTGGCCCATTTGGGGAAGTGATGAAGAGGTGCGGGCTTGT 
ACCGGTTCGAGGAAGGAATAGACAGCAGACAGGATCGCTTGCGATGGGGCAGCAGGAAACCATCAGCCCGTCCGTATCCA 
GAACTGCTGCTTGCAGCGTTAGGGGTGACTCCCTCATGCCCTAG 

Protein sequence: <seq id no: 223) 

VSSTKSKPIARGRGGPFGEVMKRCGLVPVRGRNRQQTGSLAMGQQETISPSVSRTAACSVRGDSLMP 

RS10 

DNA sequence: (SEQ ID NO: 103) 

ATGGAACGCTTGCTCGAGAGCATTTACATCAATGCCCGGCCGGCGATGGAGTTGAGGCTTAGCCTCACCAGCTCCGGCCG 
CAAGAGAATGGTAAAGATTGTGGATGGGGAGGAGGTCGAGGTTCTGCCAGGTGAAGTGCAGGGCATCCTGGAGGCCCAAA 
AGAGGGATGTTGGAATCCTCGCCGACTTCTTAGCCAAGAGTCTCGTGGCGCGACGCTAG 

Protein sequence: <seq id no: 224) 

MERLLESIYINARPAMELRLSLTSSGRKRMVKIVDGEEVEVLPGEVQGILEAQKRDVGILADFLAKSLV 

RS011 

DNA sequence: (SEQ ID NO: 104) 

ATGGAATGCCACGTTCGTCCCGCCACGAGCAGAGATGCAGCAGCGATAAGCTGCGTAGTTATAGCCGCCCTGCGTGAGTC 
AAATTCACAGGACTATCCGCCTGATGTGATCGCTCAGGTTGAGCAGAGCTTTTCTCCTGAAGCCATCACCACACAGCTTA 
CGAAGCGTAGGGTCTTCGTAGCCTTATTGGGCGAAAACATTATTGGCACTGCCGGTCTCGACGGTGACGTCGTCAGAAGT 
GTTTTCGTTGACCCAGCTCACCAGAAAGGCGGTATCGGGCGGCATTTGATGGATGTCATTCATACAACTGCTGCCAGCGC 
GGGAGTTGGAGCTGTACGTGTGCCATCGTCGATTACAGCTGAAAGGTTTTATACCGCATTGGGTTATCAGAAAATCCGCG 
ACGAGTTTCATGGGGCGGAGCGCACCATCGTTATGGAGAAGCGGCTGTAG 



Protein sequence: (seq id no: 225) 

MECHVRPATSRDAAAISCWIAALRESNSQDYPPDVIAQVEQSFSPEAITTQLTKRRVFVALLGENIIGTAGLDGDVVRS 
VFVDPAHQKGGlGRHLMDVIHTTAASAGVGAVRVPSSlTAERFYTALGYQKJRDEFHGAERTr/MEKRL 
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RS12 

DNA sequence: (SEQ ID NO: 105) 

TTGTGGTTGACCTGCACGCCACAGCAGGATGTGCAGGCGGCGTTAGCTACAGCGTCGATACTCCTGGGCCAGTTCCACCA 
GTTGGGCGTGCAGCTCGGTCGGTACACTAGCCTCGACCCGCTTGAGGAAGTCGAGAAGAACGCTTCTGCACTGCCGTCTC 
CTGCTTGGAAAACGGATTCTACTAAGTTCAGCGTGGTACTGAAATCGGGGGGCAGGTCAATCGACAAAGGTATCCCGACC 
GCAGGTTTGTTGGCCCACGTGATGGTGGCCAAGTTTGCCGATCACTTGCCGCTGTACCGGCAGGAGAAAATCTTTGGCCG 
CGCCGGGCTGGCAATTGCTCGCTCGACCCTGGCGCAGTGGGTCGGACAAACCGGCGTGCGGCTTCAGCCACTGGTCGATG 
CACTGCGTG AAGCCGTG CTG AACC AGGG CG TG AT CC ACG CTG ATG AAACACCGGTGC AAATG CTTG CG CCAGGCG AG AAG 
AAAACCCACCGGGCCTATGTCTGGGCGTACAGCACGACGCCGTTTTCAGGGCTCAAAGCGGTGGTTTACGACTTCAGCCC 
AAGCCGTGCTGGCGAACATGCGCGCAACTTCCTGGGTGACTGGAACGGCAAGCTGGTCTGCGACGACTTCGCTGGCTACA 
AAGCCGGTTTCGAACAAGGCATCACTGAAATCGGCTGCATGGCCCACGCCCGGCGCAAGTTCTTTGATTTGCACGTGGCG 
AACAAAAGTCAGCTGGCTGAACAGGCCCTGCACTCGATCAGCGGCTTGTACGAGGTCGAACGTCAGGCGCGGGACATGAG 
TG ATG AAG AG CG CTGG CG AATACG ACAAG AATTGG CGG TG CCGATC CTC AAAAAACTGCATG ACTGG ATGTTGGCTCAGC 
GAGACCTGGTGCCCAATGGATCAGCCACGGCCAAAGCCCTCGATTACAGCCTGAAACGCTGGGTAGCGCTGACGCGCTAC 
CTGGACGATGGGGCTGTGCCCATCGATAACAATCAGGTCGAGAACCAAATACGGCCATGGGCGCTCGGGCGTTCGAACTG 
GCTGTTTGCCGGGTCGCTGCGCAGTGGTAAACGGGCGGCTGCAATCATGAGCCTGATCTAG 

Protein sequence: <seq id no: 226) 

LWLTCTPQQDVQAALATASILLGQFHQLGVQLGRYTSLDPLEEVEKNASALPSPAWKTDSTKFSWLKSGGRSIDKGIPT 
AGLLAHVMVAKFADHLPLYRQEKIFGRAGIJUARSTIAQWVGQT^ 

KTHRAYWAYSTTPFSGLKAVVYDFSPSRAGEHARNFIXSDWNGKLVCDDFAGYKAGFEQGITEIGCMAHARRK^ 

NKS QLAEQALHS I SG L YE VE RQARDMS D E E R WR I RQE LA VP I LKKLHDWMLAQRDLVPNG S ATAKALD YS LKRWVALTR Y 

LDDGAVPIDNNQVENQIRPWALGRSNWLFAGSLRSGKRAAAIMSLI 

RS13 

DNA sequence: (SEQ ID NO: 106) 

ATGG TGAGGCGG CG G AGGG T CG CGGTGGCG CG CGAATG CCTG AG CCTG TCG AGCGC ACCG AACCAGGTCTTG TCG ATGG A 

TTTCGTCTTCGACGCGCTCAGCACTGGGCGACGGATCAAATGCCTGACGGTGGTCGATGACTTCACCAAGGTGTCGGTCG 

ACATCTTGGTGGAGTACGGTATCAGCGGTTTTCGTGTCACGCGGGCGCTGGACGAGATGGCGCGGTTTCGTGGCTACCCG 

CAGGCGATCCGCACCGACCAGGGCCCCGAGTTCACCGGCAAGGCGCTTGATCAGTGGGCCTGTCAGCGTGACATCAAGTT 

GAAGCTGATTCAGCCTGGCCAGCCCACGCAGAGCGCCTTCATCGAGTCATTCAACGGCAAGTTCCGGGGCGAATGCCTCA 

ATG AG CACTG CT CG CTGGT CG AAGC C AG AATCCGT AT CGCGG CTTGGCGGG ATT AC AACG AGC ACCG ACCAC AC AGCGCC 

ATTGGCAATCTCTCCCCGGCAGAGCTTGCTGCGAAGTGGCGAACCAACCAGCAGCAGCTGAAGCGGGAAAAGTTGATATC 
AACCCCATAG 

Protein sequence: <seq id no: 227) 

MVRRRRVAVARECLSLSSAPNQVLSMDFVFDALSTGRRIKCLTVVDDFTKVSVDILVEYGISGFRVTRALDEMARFRGYP 

QAIRTDQGPEFTGKALDQWACQRDIKLKLIQPGQPTQSAFIESFNGKFRGECLNEHCSLVEARIRIAAWRDYNEHRPHSA 
IGNLSPAELAAKWRTNQQQLKREKLI STP 

RS14 

DNA sequence: (SEQ ID NO: 107) 

ATGCATATCCAATCGTTGGGGGCTACTGCCTCCTCGCTGAATCAGGAGCCTGTCGAAACCCCGTCGCAGGCAGCGCATAA 
GTCCGCCAGCTTGCGTCAGGAACCTTCAGGGCAAGGTCTCGGGGTTGCCCTAAAGAGCACGCCGGGAATACTTTCCGGGA 
AGTTGCCGGAAAGCGTTAGCGACGTGCGTTTCAGCAGTCCCCAAGGGCAAGGGGAGTCCCGTACTCTGACTGACTCGGCA 
GGGCCGCGGCAGATCACTCTGCGCCAGTTTGAGAACGGAGTCACCGAGCTACAGCTCAGTCGGCCACCATTGACCAGTCT 
GGTCCTAAGCGGCGGTGGTGCCAAAGGTGCGGCATACCCGGGAGCAATGCTGGCGCTAGAAGAGAAAGGCATGCTCGATG 
GCATCCGCAGCATGTCCGGTTCGTCCGCTGGCGGCATCACCGCCGCCCTTTTGGCCTCAGGTATGAGCCCGGCGGCGTTC 
AAGACCCTTTCCGACAAGATGGATCTTATTTCGCTGCTCGACAGCTCGAACAAGAAGCTGAAGCTGTTCCAACACATTAG 
CAG CGAG ATCGG CG CATCGCTG AAAAAGGG CTTGGGCAAC AAG ATCGGCGGCTTCTCTGAGTTGCTG CTCAATGT ACTCC 
CACGCATAGATTCGCGGGCTGAGCCCCTAGAACGCCTATTGCGCGACGAGACACGCAAGGCCGTGCTCGGACAGATCGCT 
ACGCATCCAGAGGTTGCACGCCAGCCGACCGTTGCCGCCATCGCCAGCAGATTGCAGTCCGGCTCCGGAGTCACCTTTGG 
CGATCTAGATCGGTTGAGTGCTTACATTCCCCAGATTAAGACGCTGAACATCACAGGTACGGCCATGTTCGAGGGGCGTC 
CGCAATTAGTGGTGTTCAATGCCAGCCACACACCGGATCTGGAGGTCGCCCAGGCGGCACATATCTCCGGTTCCTTCCCA 
GGAGTGTTCCAGAAGGTCAGCTTGAGTGATCAGCCGTACCAGGCCGGCGTAGAGTGGACAGAATTCCAGGATGGCGGGGT 
GATGATTAACGTGCCGGTCCCTGAGATGATCGACAAGAATTTTGACAGCGGGCCACTGCGGCGCAACGACAACCTGATCC 
TTGAGTT CG AGGG CG AAGCTGGGGAGGTAGCG CCCGACCGAGG TACT AGGGGCGG CGCGCTCAAGGG CTGGGTCGTCGGG 
GTGCCTGCCCTGCAGGCGCGCGAAATGCTGCAGCTCGAGGGCCTGGAGGAATTGCGCGAGCAAACCGTTGTGGTGCCGTT 
GAAGAGCGAGCGCGGTGATTTCAGTGGCATGCTCGGTGGCACCTTGAACTTCACCATGCCGGACGAGATCAAGGCGCATC 
TTCAGGAGCGCCTCCAGGAGCGAGTCGGTGAACATCTGGAGAAACGTCTTCAGGCTTCAGAGCGTCATACCTTCGCTTCT 
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FIGURE 31E 

CTCGACGAGGCGCTGCTGGCACTTGATGACAGTATGCTCACCAGTGTTGCTCAACAGAACCCGGAGATCACAGACGGGGC 
GGTGGCTTTTCGCCAGAAGGCGCGGGATGCGTTCACCGAGCTGACTGTCGCTATCGTTAGCGCCAATGGCTTGGCGGGTA 
GGCTCAAGTTGGACGAGGCTATGCGCTCCGCTCTTCAGCGACTCGATGCGCTGGCAGATACTCCGGAACGCCTAGCATGG 
TTGGCAGCTGAGTTGAACCATGCTGATAACGTTGATCATCAGCAGTTACTCGATGCCATGCGCGGGCAGACGGTGCAGTC 
G CCGGTG CTCG CCGCTG CGTT AG C AG AGGCG C AG CG C CG CAAAG TGG CGGTTATTGCCG AG AACATTCGT AAGG AAGTTA 
TCTTCCCCTCTCTGTATCGCCCTGGCCAGCCGGATTCCAACGTAGCTCTGTTACGTCGGGCGGAGGAGCAGCTACGGCAT 
GCCACCAGTCCGGCGGAAATCAATCAAGCGCTGAACGATATCGTCGACAACTACTCGGCACGAGGCTTCCTGCGTTTCGG 
CAAACCCTTGAGTTCGACTACCGTTGAGATGGCTAAGGCTTGGCGGAATAAGGAGTTCACATGATT 

Protein sequence (seq id no:228) 

rWIQSLGATASSLNQEPVETPSQAAHKSASLRQEPSGQGLGVALKSTPGILSGKLPESVSDVRFSSPQGQGESRTLTDSA 
GPRQITLRQFENGVTELQLSRPPLTSLVLSGGGAKGAAYPGAMLALEEKGMLDGIRSMSGSSAGGITAALLASGMSPAAF 
KTLSDKMDLISLLDSSNKKLKLFQHISSEIGASLKKGl^ 

THPEVARQPTVAAIASRLQSGSGVTFGDLDRLSAYIPQIKTLNITGTAMFEGRPQLWFNASHTPDLEVAQAAHISGSFP 
GVFQKVSLSDQPYQAGVEWTEFQDGGVMINVPVPEMIDKNFDSGPLRRNDNLILEFEGEAGEVAPDRGTRGGALKGWWG 
VPALQAREMLQLEGLEELREQTVWPLKSERGDFSGMLGGTLNFTMPDEIKAHLQERLQERVGEHLEKRLQ 
LDEALLALDDSMLTSVAQQNPEITDGAVAFRQKARDAFTELTVAIVSANGLAGRLKLDEAMRSALQRLDALADTPERLAW 

liaae'lnhadnvbhqqlldamrgqtvqs pv1j>mlaeaqrrkvavi aeni rkevi fpslyrpgqpdsnvallrraeeqlrh 
- : Atspae-inqai^dxvdnysargflrfgkplssttvemakawrnkeft 

RS15 

DNA sequence: (SEQ ED NO: 108) 

ATGATTGATACATGGCTGGCACAGTGGGGCTTGAGACTTCCCTCGAGCAACGATGCCACGTTGCGGCTGCAACCGGCAGA 
GGGACCGGAACTGGTTATGGAGCGCCTCGAGGGCGGTTGGCTTTTCGTCGTCGAGTTGGGACTTGTGCCTTCAGGGTTAC 
CGCTGGGTGTGATCTTGCAATTGTTACAAGTGAACTCTCCATTCTCATCCTTGGCACCGGTGAAACTTGCGGCGGACGAT 
G CCGGT AG ACTTGTG CT CTGGGCTG AGG CACGTG ATGGCGTTG ACG ATGTGG ATG CACTG AACCGCTTGCACG ATAGGCT 

GCGGGAAGGACATTCACGATTAGTGCCATTGCTAGAGCCCACGGGTGAGTTGGTTCCAGCTCAGATACAAACCAGCGCGT 
TAGTGTTCGTTTGA 

Protein sequence: (seq id no: 229) 

midtwiaqwglrlpss^atlrlqpaegpelvmerleggwlf 

agrlvlwaeardgvddvdalnrlhdrlreghsrlvplleptgelvpaqiqtsalvfv 
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Big Island: Overall Nucleotide Homology 
Total 84830bp 

Bp # Species, strain, gene name Accession # Evalue 

/ %identity 

23559-25465: X. axonopodis pv. Citri strain 306 AE011864 83% 

Only stretches within the 20437-25465bp are homologous to X. axonopodis pv. Citri strain 306. 
A total of 1 060bp, not contiguous, from this region are homologous to X. axonopodis. 

33872-38412: P. aeruginosa, PA14, pvrR AF482691 0.0; 99% 

40989-46535: P. aeruginosa, PA01, PA2128-2132 AE004640 80% 

Only stretches within the 40989-46535bp region are homologous to PA01. A total of 2406bp, not 
contiguous, from this region are homologous to PA01. 

48266-49533: P. putida, plasmid pWWO AJ344068 96% 

Only stretches within the 48266-49533bp are homologous to P. putida, plasmid pWWO. A total 
of 780bp, not contiguous, from this region are homologous to P. putida. 



56824-58706: P. syringae pv. maculicola, plasmid pFKN AF359557 83% 

Only stretches within the 56824-58706bp are homologous to P. syringae pv. maculicola, plasmid 
pFKN. A total of 1 882bp, not contiguous, from this region are homologous to P. syringae. 

64748-64942: P. aeruginosa, PA103, exoU, U97065 lE-85/96% 

82447-85179: P. aeruginosa, PA01, PA0984-0985 AE004531 0.0/97% 

85334-855542: 3E-80/94% : 
93200-93317: P. aeruginosa, PA 158 X73064 7E-50/98% 



108075-108610: P. aeruginosa, SG17M, plasmid pKLCl02 AF285416 0.0/91% 
100119-101054: P. aeruginosa, PA01, PA3849, AE4802 0.0/98% 
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Small Island: Overall Nucleotide Homology 
Total 10848bp 



Bp# 



(Species, strain, gene name 



Accession #) 



E value 
/ %identity 



AE004531 
AP285416 
AE004531 



877-1 632: P. aeruginosa, PA01 , PA0977 

877-2603: P. aeruginosa, SG17M plasmid pKLCl 02 
2978-6471: P. aeruginosa,? A0\,?A0m-S\ <> ^™ Ji 
7035-7999: P. syringae pv. maculicola, plasmid-pEKN .,- AF359557.7 - 

Only stretches within the 7035-7999 bp are homologous to 11 " - 

A total of 534bp, not contiguous, from this 



0.0/94% 
0.0/92% 
0.0/99% 
83% 

P. syringae pv. maculicola, plasmid pFKN. 



7999- 8284 

8000- 8080 
8120-8259 
8272-8860 
8470-11724 



region are homologous to P. syringae. 
P. aeruginosa, PA 103, exoU, AF27291 
P. aeruginosa,? A01 intragenic region AE004531 
P. aeruginosa,? A01 intragenic region AE004531 
P. aeruginosa, PA01 intragenic region AE004531 
P. aeruginosa, PA 1 03 , exoU, AF2729 1 



E-l 36/96 
E- 18/91% 
E-24/85% 
E-l 76/88% 
0.0/99% 
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Alignment: NoJ2 - embl|AL039136|HSM003612 

• I 

Homo sapiens mRNA; EST DKFZp566K094 j1 (from clone DKFZp566) 



18 DOTCDNLSQNPPHHLLLRLLDHWGDPAGCWSLGQTYSGHLYLPYCRELHKCSLCAHRWH SEQ ID NO: 230 
D0TCDNLSONPPHHLLLRLLDHWGDPAGCWSLGQTYSGHLYLPYCRELHKCSLCAHRNMH SEQ ID NO: 231 
2 9 DOTCDNLSQNPPHHLLLRLLDHWGDPAGCMSLGQTYSGHLYLPYCRELHKCSLCAHRNWH SEQ ID NO: 232 

HYCCLW PVWMLC YMSW 9 3 

HYCCLWPVWMLCYMSW 

HYCCLW PVWMLCYMSW 256 
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Alignment: No_8 - embl|M79137|HSXT01285 



EST01285 Subtracted Hippocampus, Stratagene (cat. #936205) H 

Q: 18 0V0HPPLCLLD0H00EC1PPCLPPDHLQDPQHPFLLPDHHVPHLVVL10PQLCRALAP SEQ ID NO' 233 
75 

QVQHP .CLLDOHOOECIPPCLPPDHLODPQHPFLLPDHHVP.LVVLIQPQLCRALAP; SEQ ID NO: 234 

H: 43 QVQHPXXCLLDQHQQEC1 PPCLPPDHLQDPQHPFLLPDHHVPXLVVL1 QPQLCRALAp! SEQ ID NO* 235 

216 • ! . 
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Alignment: No47 - swissnew|P35555|FBN1 JWMAN 



FIBRILLIN 1 PRECURSOR.//:swiss|P35555|FBN1_HUMAN FIBRILLIN 1 
r^ R r CUR cOR7/:trembl|L13923|HSFIBRLLNJ product: "fibrillin"; Homo sapiens fibrillin 
mRNA, complete cds. //:gp|Ll3923|306746 product: Tibrillin"; Homo sapiens fibrillin mRNA, 
complete cds. 

0 . 18 XGGAS.CHNTLGSYKCMCPAGFQYEQFSGGC0D1NECGSAQAPCSYGCSNTEGGYLCGCPP SEQ ID NO: 236 

-- — G^GASCHNTLGS-Y-KCMCPAGFQYEQFSGGCQDlNECGSAQAPCSYGCSNTEGGyLCGCPP SEQ ID NO: 237 
H: 2617~ CGGASCHNTL ~ GSY ^^ SE Q ID NO: 238 

GYFR3GQGHCVSGMGMGRGNPEPPVSGEMDDNSLSPEACYECK1NGYPKRGRKRRSTNET SEQ ID NO: 236 
GYFR1G0GHCVSGMGMGRGNPEPPVSGEMDDNSLSPEACYECK1NGYPKRGRKRRSTNET SEQ ID NO: 237 
GYFR1GQGHCVSGMGMGRGNPEPPVSGEMDDNSLSPEACYECKINGYPKRGRKRRSTNETSEQ ID NO: 238 

DASNIEDQSETEANVSLASWDVEKTA1FAFNISHV-NKVR1L 178 SEQ ID NO: 236 

DASNIEDQSETEANVSLASWDVEKTAIFAFNISHV NKVR1L SE q X D NO: 237 

DASNIEDQSETEANVSLASWDVEKTAIFAFNISHVSNKVRIL 2778 SE q id NO: 238 
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Alignment: N656- trembl|AF088916|AF088916J 



gene: "EMn product: "elastin microfibril interfase located protein"; Homo sapiens elastin 
microfibril interfase located protein (EMI) gene, complete cds. 

//:trembl|AF 08891 6)AF088916_1 product: "emilin precursor"; Homo sapiens emilin precursor, 
mRNA, complete cds and 3' UTR. //:gp|AF 08891 6J5353510 product: "emilin precursor"; 
Homo sapiens emilin precursor, mRNA, complete cds and 3 1 UTR, 
//:gpnew|AFl62780|6693840 gene: "EMI"; product: "elastin microfibril interfase located 
protein"; Homo sapiens elastin microfibril interfase located protein (EMI) gene, complete cds. 

q. 7 DGDVYNPSTGVFTAPYDGRYLI TATLTPERDAYVEAVLSVSNASVAQLHTAGYRREFLEY SEQ ID NO: 239 

DG . . Y : P . TGVFTAP . GRYL: :A.LT .R. . VEAVLS SN . . VA: : . : .GY . E LE SEQ ID NO: 240 
H: 896 DGGYYDPETGVFTAPLAGRYLLSAVLTGHRHEKVEAVLSRSNQGVARVDSGGYEPEGLE-sEQ ID N q. 241 

HRPPGALHTCGGP-GAFHLIVHLKAGDAV 94 SEQ ID NO: 239 

::P :.• G. G.F.LI: L: AGD.V : SEQ ID NO: 240 

NKPVAESQPSPGTLGVFSLILPLQAGDTV 983 SEQ ID NO: 241 



gene: "EMI"; product: "elastin microfibril interfase located protein 1 '; Homo sapiens elastin 
microfibril interfase located protein (EMI) gene, complete cds. 

//:trembl|AF0889l6|AF0889l6jl product: "emilin precursor"; Homo sapiens emilin precursor, 
mRNA, complete cds and 3' UTR. //:gp|AF088916|53535l0. product: "emilin precursor"; 
Homo sapiens emilin precursor, mRNA, complete cds and 3' UTR. 
//:gpnew|AFl62780|6693840 gene: "EMI"; product: "elastin microfibril interfase located 
protein"; Homo sapiens elastin microfibril interfase located protein (EMI) gene, complete cds. 

q. 7 DGDVYNPSTGVFTAPYDGRYL1TATLTFERDAYVEAVLSVSNASVAQLHTAGYRREFLEY SEQ ID NO: 242 

DG. .Y: P. TGVFTAP . GRYL: :A.LT .R. . VEAVLS SN . . VA: : . : .GY . E LE SEQ ID NO: 243 
H: 896 DGGYYDPETGVFTAPLAGRYLLSAVLTGHRHEKVEAVLSRSNQGVARVDSGGYEPEGLEN SEQ ID NO: 244 

HRPPGALHTCGGPGAFHLIVHLKAGDAV 94 SEQ ID NO: 242 

. : : G G.F.LI: L: AGD.V SEQ ID NO: 243 

KPVAESQPSPGTLGVFSLILPLOAGDTV 983 SEQ ID NO: 244 
V 
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Alignment: No59 - pironly|A35763|A35763 



unnamed ORF; P.lividus 2-alpha collagen (COLL2-alpha) mRNA, complete cds. 
// pironly|A35763|A35763 collagen alpha 2 chain - sea urchin (Paracentrolus lividus) 
(fragment)//^Pl^ 054 22|1 58962 unnamed .QBF; P.lividus 2-alpha collagen (CQLL2-alpha).y , . . ... 
mRNA, cornpfete cds: ~ T^--'^^:~^Z^^- ^.ri'lliriz^:: ' _ 

0 . 92 GENGSSGSQAPLQGLRG1FGLWGRRSRARFCGPR-PVARLGGGTSAGRELGL 142 SEQ ID NO: 245 

GE G.SG...P QG:RGI G: G. . . : . GPR P . GGG S G . . GL SEQ ID NO: 246 

H . 71 8 GEPGPSGENGP-QGVRGIPGWGENGKTGRGGPRGPPGLRGGGGSRGERGGL "768 SEQ ID NO: 247 



unnamed ORF; P.lividus 2-alpha collagen (COLL2-alpha) mRNA, complete cds. 
// pironly|A35763|A35763 collagen alpha 2 chain - sea urchin (Paracentrolus lividus) 
(fragment)//:9P|J05422|1 59962 unnamed ORF; P.lividus 2-alpha collagen (COLL2-alpha) 
mRNA, complete cds. 



92 GENGSSGSQAPLQGLRGIFGLWGRRSRARFCGPR-PVARLGGGTSAGRELGL 142 SEQ ID NO: 248 

GE G.SG...P QG:BG1 G: G. . . : . GPR P . GGG S.G.. GL SE q ID N0 . 249 

71 8 GEPGPSGENGP-QGVRG1PGVVGENGKTGRGGPRGPPGLRGGGGSRGERGGL 7 68 g E q ID NQ . 2 5Q 
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Alignment: No60/63 - swiss|P20062|TCO2^HUMAN 



TRANSCOBALAMIN II PRECURSOR.//:trembl|M60396|HSTCIM gene: ,, TCN2 W ; product: 
"transcobalamin II"; Human transcobalamin II (TCII) mRNA, complete cds. 
V/:gp|M60396|339l96 gene: "TCN2"; product: "transcobalamin II"; Human transcobalamin II 

jri^ll)-rnRNA; completers.'- ' :: '\^~:^^Zl. ^ 

q. 8 VEPFHOGHHSVDTAAMAGIAFTCLKRSNFNPGRRQRITMAIRTVREEILKAQTPE'CTFGN SEQ ID NO: 251 

VEPFHQGHHSVDTAAHAGLAFTCLKRSNFNPGRRQR1TMA1RTVREE1LKAQTPEGHFGN SEQ ID NO: 252 
H: 183 VEPFHQGHHSVDTAAMAGLAFTCLKRSNFNPGRRQRITMAIRTVREEILKAQTPEGHFGN SE q ID NQ ! 2 53 

VYSTPLALOFLMTSPMPGAELGTACLKARVALLASLQDGAFQKALMISQLLPVLNHKTYI SEQ ID NO: 251 
VySTPLALOFLMTSPNPGAELGTACLKARVALLASLODGAFQNALMlSOLLPVLNHKTYlSEQ ID NO: 252 
VYSTPLALQFLMTSPMPGAELGTACLKARVALLASLQDGAFQNALMlSOLLPVLNHKTYIsEQ ID NO: 253 

DLIFPDCLAPRVMLEPAA 145 SEQ ID NO: 251 

DLIFPDCLAPRVMLEPAA SE q Ir) NQ ! 2 52 

DLIFPDCLAPRVMLEPAA 320 ^ ^ ^ ^ 



TRANSCOBALAMIN II PRECURSOR,//:trembl|M60396|HSTCIM gene: "TCN2 n ; product: 
"transcobalamin II"; Human transcobalamin II (TCII) mRNA, complete cds. 
//:gp|M60396|339196 gene: "TCN2"; product: "transcobalamin IP; Human transcobalamin II 
(TCII) mRNA, complete cds. 

q. 8 VEP FHQGHHSV DT AAMAGLAFTCLKRSN FNPGRRQR1 TMA ' Al SEQ ID NO: 254 

VEPFH0GHHSVDTAAMAGLAFTCLKRSNFNPGRRQR1TMA SE q id N q! 255 

H: 183 VEPFHQGHHSVDTAAMAGLAFTCLKRSNFNPGRRQR3TMA 222 S EQ ID NO* 256 



TRANSCOBALAMIN II PRECURSOR.//:trembl|M60396|HSTCIM gene: "TCN2";. product: 
"transcobalamin IP; Human transcobalamin II (TCII) mRNA, complete cds. 
//:gp|M60396J339l96 gene: M TCN2 M ; product: "transcobalamin II"; Human transcobalamin II 
(TCII) mRNA, complete cds. 

q. 8 VEPFHQGHHSVDTAAMAGLAFTCLKRSNFNPGRRQR1TMA 41 S EQ ID NO* 257 

VEP FHQGH H SV DT AAMAGLAFTCLKRSN FN PGRRQRl TMA SEQ ID NO* 258 

H: 183 VEPFHQGHHSVDTAAMAGLAFTCLKRSNFNPGRR0R1TMA 222 S£ g ID * ^ 
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Alignment: No65 - swissnew|P23142|FBL1_HUMAN 



_ ^ nnrn mrnn //..,.,:^in'j'7000iCDl n UllMAkl trim 11 l^nPHRM H 

PRECURSORJ/:trembl|U01244|HS2444_1 product: "fibulin-1D"; Homo sapiens f»bulin-1D 
mRNA, complete cds. //:gp|U01244|1621019 product: Tibulin-ID"; Homo sapiens fibutin-1D 
mRNA, complete cds. 



18 KNCQD3 DECVTG 3 HNCS 3 NETCFN 3 QGGFRCLAFECPENYRRSAAT1/QQEKTDTVRC3 KS ' SEQ ID NO: 260 

RMCOD3DECVTG2KNCS2NETCFN3QG.FRCLAFECPENYRRSAATLQQEKTDTVRC1KS SEQ ID NO: 261 

521 RNCQD3DECVTG1HNCS2NETCFN3QGAFRCLAFECPENYRRSAATLQQEKTDTVRCIKS SEQ ID NO: 262 

CRPNDVTCVFDPVHT3 SHTV3 SLPTFREFTRPEE3 3 FLRA3 TPPHPASQAN3 3FD3TEGN SEQ ID NO: 26Q 

CRPNDVTCVFDPVHT3 SHTV3 SLPTFREFTRPEE3 3 FLRA3TPPHPASQANI 3 FD3 TEGN SEQ ID NO: 261 

CRPNDVTCVFDPVHT3 SHTV3 SLPTFREFTRPEE3 3 FLRA3TPPHPASQANI 3 FD2TEGN SEQ ID NO: 262 

LRDSFD3 3 KRYMDGMTVG3 RR 158 SEQ ID NO: 260 

LRDSFD2 3 KRYMDGMTVG : R SEQ ID NO: 261 

LRDSFD3 3 KRYMDGMTVGVVR 661 SEQ ID NO: 262 
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Alignment: 80 - trembl|AF045447|AF045447_1 



g?rS^PC4"; product: "deleted in pancreatic carcinoma"; Homo sapiens deleted in 
pancreatic carcinoma (DPC4) gene, exon 11 partial sequence and complete cds. 
//•trembl|U44378|HS443781 _1 gene: "DPC4"; product: M Dpc4 M ; Human homozygous deletion 
target in pancreatic carcinoma (DPC4) mRNA, complete cds. //:pironly|S7l811|S7l811 
probable transcription regulator MAD-4 - human//:gp|AF045447|2865657 gene: "DPC4"; . 
product: "deleted in pancreatic carcinoma"; Homo sapiens deleted in pancreatic carcinoma 
(DPC4) gene, exon 11 partial sequence and complete cds. //:gplU44378|1163234 gene: 
"DPC4"; product: "Dpc4"; Human homozygous deletion target in pancreatic carcinoma 
(DPC4) mRNA, complete cds. 



6 PGSRlRGRVDTLOXNAFXXMMVKDEyVHDFEGQPXLXTEGHXlQTlQHPPXNRAXTETYX SEQ ID NO: 263 

PG :.G TLQ.NAP. .MMVKDEYVHDFEGQP .L.TEGH . 1QTIQHPP .NRA. TETY . SEQ ID NO: 264 

139 PGI DLSGL — TLQSNAPSSMMVKDEYVHDFEGQPSLSTEGHS1QTIQHPPSNRASTETYS- SEQ ID NO: 265 

TPALLAPXEXNATXTANFPN1PVAXTXQPAX1LGGXHXEGLLQIAXGPQPGOQQNGFTGQ SEQ ID NO: 263 

TPALLAP . E . NAT . TANFPN1 PVA . T . QPA . 1 LGG . H . EGLLQI A . GPQPGQQQNGFTGQ, SEQ ID NO: 264 

TPALLAPSESNATSTANFPNIPVASTSQPASILGGSHSEGLLQIASGPQPGQQQNGFTGQ SEQ ID NO: 265 

PATYHHNXTTTWTGXRTAPYTPNLPHHQKG 155 SEQ ID NO: 263 

PAT YHHN . TTTWTG . RTAPYTPNLPHHQ . G SEQ ID NO: 264 

PATYHHNSTTTWTGSRTAPYTPNLPHHQNG 286 SEQ ID NO: 265 
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Alignment: N086 - trembl|D32210|D32210J 



gene; "Notch2"; product: "cell surface protein"; Mus muscuius (Notch2) mRNA, compieie cds. 
//:gp|D322l0|2373395 gene: "Notch2"; product: "cell surface protein"; Mus muscuius 
(Notch2) mRNA, complete cds. 



81 MPALRPALLWALLALWLCCATPAHALQCRDGYEPCVKEGMCVTYHNGTGYCKCP-GFLGE SEQ ID NO: 266 
MP LRPA.L ALL LWLC A PAHALQCR . G . EPCVNEG . CVTYHNGTG : C : CP" GFLGE SEQ ID NO: 267 
1 MPDLRPAALRALLV5LWLCGAGPAHALQCRGGQEPCVNEGTCVTYHNGTGFCRCPEGFLGE SEQ ID NO: 268 

YCQHR-PCEKNRCGDPSTC 15*7 SEQ ID NO: 266 

YCOHR PCEKNRC : .TC SEQ ID NO: 267 

YCQHRDPCEKNRCQNGGTC IS SEQ ID NO: 268 
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j^j^^ GGCCGCGTCGACGACCAAACCTGTGATAACCI GinCAAAACCCTCClCATCATCT ACICCIICG'TCT ^ SEQ ID NO: 109 
| adaptor 1 

] R GRV . 6 DOTCOM LS ONPP HH-LLLRL SEQ ID NO: 269< 
j ' , i . . i i 1 i > i ■ * i ■ 



"f CTGGAT C AC1GGGGTGATCC1GC1 GGC1G*J7 GGAG1C1GGGGCAAACT1 AC1C1GGGCACCT Al ATCTCCCTTA 

. = . 1 S I "■ I ~i ,1 ■ 1 = I 1 .,, . | 



160 



in HWGDPAGCWSLGOTYSGHLYLPY 

■ ' ' ■ ' ' ' ' ' ' ' ' ' L 

nGCCGAGAAClCCACAAATGCTCCOAlGlGnCATCGGAAClGGCACCACTATTGTTGlCTTTGGCCTGTYTG 
._ _j 1,, i i i, i 1 ■ . . i ■ ■ ' i , . i . » 



225 



rRELHKCSLCAHRNWHHYCCLWPVW 
, i i 1 ■ 1 i » i 

GAIGCTH GCTACA7G1CG7GG1AGCCCA7 GGA1GC1GAAAC1G1 ATGCCA'TGT'nCf GTCCC7 GG7GTTCC1GG 
i- » • ' ■ ' ■ » ' 1 ' 1 ' L 300 



nL CYMSW.PflDA ETVCHVSVPGVPG 



CIGAGC TCGTAGCTGGCAm 

ARSWHFRVCVSS .DOGHLPEDLHG 
. i . 1 . i 1 . • 1 ■ ■ ■ ■ ■* 



375 



ACGC1 AlGCAGACTT ACAA7GGCAA7 GAlGAGAGGAGCCGGGCAGlGGACCAIGTGCAGCGCAGCCfGAGCTGCT 

• I ■ 1 ■ 1 ■ ^ ' » — - : ' ' L iJ50 

o yADLOWO. . EEPGSGPCAAOPELL 

. ^ 1 . > . , I . , I I . 1 i 1 J 



G1GG1G1GCAGAACTACACCAAC1 GGAGCACCAGCCCCTACtTCCTGGAGCATGGC 

t .* — , — ■ i ■ i . i 1 ■ i i * ' ■ i i . . — 



V/CAELHOLEHQPLLPGAW 

— — i i i i i i i ■ ■ i , . . i 



506 
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AAnCGCGGCCGCGTCGACCAAGlGCAACACCCTCCAClGIGCCiniGGACCAGC ACCAACAGGAMGUKCC SEQ ID NO: 110 

^ , i » 1 • 1 1 1 ■ 75 

y adaptor | 

j rg RVD0V0HPPLCLLD0H 0OEC ] P SEQ ID NO: 270 
H— k 1 ' ' ' ' ' L — ' ' ' ' ' ' * 

1CCG1GCCT CCCACCGGACCACC1CCAGGACCCCCAGCACCCTT1CCTCCT1CCGGACCATCA7GTCCCCCACCT ^ 



P 

i 



ri PPDHL ODPOHPFLtPDHHVPHl 

■ . i . i 



GGT GG1 C C T T A T C C A G C C C C A A C 1 G TG C C G G G C C C T G G CC gC A£A G G GG^M A- ^-C-l^ C AC C A A Al^AT-G G CXTT T =-. 
, "j1 — ; — » ■ r " i — » ' — • ■ — 1 1 — *• 225 



v v L 1 O P 0LCRALAP0GH1LH01CPF 
, i i i i — » * ■ ■ 1 ■ i ■ . ■ i i ■ ■ 

CCAGAGCT ACCCANACCAUTGG7GCACCCACAGAKCAGC1GCAGCTGGUCTnAGGTCCATGGGGA"rCCATG 



300 



0SYP7HMVHP010LQLVL.VHGDPC 
. ■ ■ ■ 1 ' 1 ' ' ~~ 1 ' 1 ' L 



375 



1 QT1CTGG ACC77 GGGCGCCAGGAATGGGAGGGCAGT ATCC7 ACCCCT AATATGCCATA1CCATC1CCAGGCCCA 

i iDLGROEWEGS I LPL I CH I H LOAH 

_ -* ■ ■ • • ' ■ ■ 1 ' ' ' ' 

T ATCCCGNTCCTCCTCCTCCCCAAGCCCCTGGGGCAGCACCACCTGnCCAf GGGGCACCGTTCCACCAGGAGCC 
' Alt : u i ■ - j » • ' ■ »• 450 

»P7LLLPKPLGOHHLFHGAPFHOEP 
1 , , i , i . >..,., 1 i ■ i 

TGGGGAC ^CAGCACCATATCCIGCCCCTACAGGAICGTATCCCACACCAGGACTCTAKCUCTCCC 
6 D H fl H H 1 LPLOOR IPHODSILLP 
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AATUGC6GCCGCG1CGAC1GCG6AG6A6CCTCC16KACAACACCCTG66GAGCTACAAG16CA1GT6TCCC6C SEQ ID NO: 111 

I ; , • - - - — »— — — »- 1 ■ ■ ■ ■ ■ i ... i . i i ■ i . .i i 75 

I adaptor 1 



1 r G RVOC GGASCHNTLGSYKCHCPA SEQ ID NO: 271 

, » i . i . ■ . i . i . t , 



CGGC7 T CC AG7 A7GAACAGT7 CAG7 GGAGGA7GCCAAGAC A7 CAA7 GAA7 G7GGC7 C7 GCGC AGGCCCCCTGC AG 
u 1 ■ y ■ . , . , . . . , l 15Q 



G F 0 Y E ^° ZI1A G &~ t~ Q^-Q ~J N E C G S A — Q — A*- -p — G — S - 



C7A7GGCTGT7CCAA7ACCGAGGGCGG77ACC7 G1G1GGC1G1CCACC1 GGT1 ACTTCCGCAl AGGCCAAGGGCA 
^ ■ • » . 1 1 ' ■ > ■ 1 * 225 



Y6CSNTEGGYLCGCPPGYFRIG.0GH 



C1G1GTTTCTGGAATGGGCATGGGCCGAGGAAACCCAGAGCCACCTGTCAGTGGTGAAATGGATGACAATTCACT 

— — i_ — ' — 1 1 • 1 ' 1 1 » — 1 ■ 1 ■ ' 300 



CVSGMGMGRGNPEPPVSGEHODNSL 
i — ■ « — — — « — « » 1 1 ■ 1 ■ l 



C7CCCCAGAGGC7 7G77 ACGAG7 G7AAGA7 CAA7GGC7ACCCCAAACGGGGCAGGAAACGGAGAAGCACAAACGA 
' ■ 1 s ' ■ " ■ ' - » • « . 375 



SPEACYECKl NGYPKRGRKRRSTNE 

. 1 ■ - ■ • * ■ ■ ■ i ■ » - * ■ » * » . . 



AAC7GA7GCC7CCAA7 A7CGAGGA7CAG7C7GAGACAGAAGCCAA7G7 GAG7CT7GCAAGT7GGGA7G77GAGAA 

. -> 1 ' ' ' 1 1 ' 1 1 1 1 — : — ■ H50 



T0ASNIED0SE7EANVSLAS W D V E K 

1 ' ■ » ■ I L^M I , . I 



GACAGCCA7CTT7GC7 77C AA7 A777CCCACG7CAGTAACAAGG77CGAA7CC7ANAACTCCTT 
^ . ■ . , 1 ■ . , . , . ► 51(J 

; » 

JAlPAFN 1SHVSNKVR IL7LL 

■ . i — ' ' 1 ■ 1 ' " > 
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AATTCGCGGCCGCGKGACGGGGAlGnUCAACCCCAGCACCGGGGTCTTCACGGCKCllAlGATGGGCGCTA SEQ ID NO: ii2 

j --. . 1 ■ « -—-J ' 1 ' 1 ' 75 

j adaptor | 

IRGRVDGDVYNPSTGVFTAPYDGRY SEQ ID NO: 272 
I , . . — . — ■ i. i ■ i ' » • » - 

C CI GAT CACGGCCACCCTCACCCCCGA6AGAGACGCC1 ACG1GGAAGCAGTGCTG1CGGTCTCCAACGCC AGCAG ^ 



L 1 T__A T L T PERDAYV E A VLSVSNASS 

, ■ ~~ ! ■ " ■ ■ "" ■ ; » . ■ 1 ' " ,- a- i .1 . l 

1GGCCCAGC7 6CAT ACCGC1GGGT ACAGGAGAGAGUCCIGGAAT ACCACCGCCClCCAGGAGCntGCAT ACCT 

, » ■ ■ ■ J 1 1 ■ 1 b 225 



GPAAYRWVQERVPGIPPPSRSFAYL 



GCGGGGGCCCGGGGGCAT7CCACCTCA7CGTGCACCTGAAGGCGGGAGA7GCAGTCAACGTCG7GG1GAC7GGGG 
, , i i . i i i i * i i .i 



300 



r G p G G 1 P PHRAPEGGRCSORRGDWG 

r ■ - ■ - » . I 1 ■ I ■ . ' ' ' ■ ■ ■ I 

GCAAGC1 GGC1CACACAGACTT1 GAT GAAATG1 ACT CC ACA7 IT AGTGGGGTTTTCTT A7 ATCCT1TCCTT1CCC 

. > 



375 



OAGSHRL . N V L H 1 .WGFL1SFPFP 



ACCTCTAAGGTGGC1GGG6AGATGT 

— _i_ . ■ ■ ► 1)00 

PLRWLGRC 
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AMICGCGGCCG CG^CGACAAAGAAAAAAAGAAAG'y'IllCACIClGGGCIGT GGAAC7A777C AGGACnCC7<A SEQ ID NO: 113 

^ — ^ . i . i . ' ' ' "~ 75 



j adaptor ~1 



1 R G R V D , K E , K K K V F , T 1 G , C G T 1 S G L . P ^ SEQ ID NO: 273 

G GG6TllCCTCTGGAGCTTCClGAG7TlCCKCl6GACATTTTGKTCCAGGTCCCAGCGCCAGGCA6GGGTfiQr 
, . . . , , , . , ■ ■ i ■ -i. 15Q 



GFP lelpefppghfvsrsoroagn a 

J T "» 1 ' 1 ^ — -.!_ ■ . .. l. — — * 



UCCGGAAGGGCTG7GGGlGCCACCC1GGC7GAC1GCAGCCC7C7CnGCACC7CCTCCCGGCCAlCCACCCGCA 
^ ^ . . , . , . , . . ' 1 » ^ 225 

pg r AVGATLADCSPLLHLLPAI HP q 

^ | 1 ■ ! 1 ... I > ■ 1 t , ■ , 1- , « , 

GGAGGTCTTCCCCCAGCACTGGmGlGAGGAGCrCCCTCTGCCCGGGAGAAAAT GGCUn CCGGQlCACAGGC 
— " ' ' ' ' 1 1 ' " ' ■> 300 



EVf rPOHWLVRS$LCPGE N G S $ G S Q * 



■jCCCC7CCAGGGAC7GAGGGGCA77777GGA77G7GGGGAAGGCGC7CCAGGGCCCGG77C7G7GGCCCCAGGCC 
, ^ , , - , , , . ■ , fc 



P L 



qGLRGIFGLWGRRSRARFCGP r 



7G77GC7CGGC7GGG7GGAGGCACC7C7GCAGGCCGGGAGC77GG7C777GAACACC7 



433 



4 



va rLGGG7 SAGRELGL.7P 
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AA77CGCGGCCGCG7CGACGGGGA7G777 ACAACCCCAGCACCGGGG7C77CACGGC7CC77A7GA7GGGCGCTA SEQ ID NO: 114 

I J ■ ' ' ■ ' ' ' ' ' ' " 75 



| adaptor | 



1 rg RVD GDVYNPSTGV FTAPYDGRY SEQ ID NO: 274 

I , -i — • ^-i . 1 . 1 u ■ ■ ' 1 * 

CC7GA7CACGGCCACCC7 CACCCCCGAGAGAGACGCC7ACG7GGAAGCAG7GC7G7CGG7C7CCAACGCCAGCAG 
i ~* ' ■ ' ■ • ' ■ » ' ' ■ l 150 



L I f A T L T P ~ ~E~~~ = R "O A" Y y "fT" A V ~L~ S~ V S N "A S S 
i . * - * « i i . . « . * - 



IGGCCCAGCl^CAl ACCGC1GGG1 ACAGGAGAGAGTTCCtGGAAT ACCACCGCCCTCCAGGAGC7T1GCATACCT 
_ . ■ . 1 . , . . . 1 . i ^ 225 



gpaayrwvoervpgipppsrsfayl 

■■ — — i ■ « • 1 ' • 1 » ». 



gcgggggcccgggggcattccacctcaicgtgcaccigaaggcgggagaigcagtcaacgtcgiggigacigggg 

■ — : — 1 • 1 ■ h 1 • ■ ■ ■ ■ 1 ' 1 L 300 



RGPGG I PPHRAPEGGRCSORRG OWG 

t ^ 1 . 1 ■ i I ■ i i i i I ■ 1 i i . 1 ■ ■ i I 

GCAAGC7GGC7CACACAGAC777 GA7 GAAA7G7 AC7CC ACA7 77 AG7 GGGG7777C7T AT A7CCT77CC777CCC 
_ . . . , . , . , 1 . 1 . . . 375 



QAGSH RL . N V L H 1 .WGFL1SFPFP 

i. — ■ ■ » ■ ■>■■■■■ . ■ I ' ' 1 i I ... I I 



ACCTC7AAGG7GGC7GGGGAGA7GT 
1— . ■ 1 ■ > 400 

plrwlgrc 
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AAnrcCGGCC GC67CGACCGCAAC1 G1CAAGACA71GA1GAG1GTG1GA.C1GGCA1CCACAAC1GC1 CCATCAA SEQ ID NO: 115 

An 1 _ , , , u , , , , , . | , , , , ». 75 



, R G RVDRNCODIDECVTGl-HNCSl N SEQ ID NO: 275 

^ , ^ > ' ■ 1 ^ 1 1 1 1 1 

rr AT AC CI GCT7CAACAKCAGGGCGGCT1CCGC7GCC1 GGCCTTCGAGTGCCCl GAGAAC7 ACCGCCGCTCCGC 
LbA . » — — ' • 1 ■ ' » ■ ■ • — —J- 150 

E T C F N OGGFRCLAFECPE N Y F ^ R S A 

art TACGCl CCAGCAGGAGAAGACAGACACGG1 CCGC1GCA7 CAAGTCCTGCCGCCCCAACGATGTCACA7GCGT 
m , ^ • 1 . » ^— — i ■ « ■ > 225 



7 L OOEKTDT V R C IKSCRPNDVTCV 



GnCGACCCCGTGCACACCATCTCCCACACCGTCATCKGCTGCClACClKCGCGAGlTCACCCGCCCTGAAGA 

i — - 1 1 ■ ' 1 1 1 J — ' ' L 300 



cnpVHT ISHTV 1SLPTFR EFTRPEE 
, ^ » . i i i , i . 1 , ■ . — i ■ i 

CAifA7CTTCCTCCGGGCCAlCACGCCACCGCA7CC7GCCAGCCAGGCiAACA7CA7CT1CGACATCACGGAAGG 
bAILA ^ ' ■ 1 ■ » • ' ■ > ■ »— - 375 

j | f L R A 1TPP HPAS 0AN1 1 FD 1 TEG 



GAACC1 GCGGGACTCTTTTGACATCATCAAGCGTT ACA1 GGACGGCAlGACCGIGGGTG'f CGTGCGCCAGG1GCG 



450 



NL RDSFDI 1KRYMDGMTVGVVR0VR 
r ■ i — ■ » 1 • i 1 — — ■ i.i 1 , ■ i 

GCCCAKGfEG GCCCA'nTCAl GCCG1CCT GAAGCTGGAGATGAACT A1G1GGTCG 



piVG PFHAVLKLEMNYVV 

u — ■ » i ■ — — 1 , i ■ i ■ i 
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AAfTCGCGGCCGCGTCGACACACTGCAGAGf AATGC1CCATCAAG1 A1GA1GGTGAAGGA1GAA1 A1GTGCATGA SEQ ID NO: 116 

\ ' ' 4 1 1 — — 1 1 L — 1 ' ! - 75 



| adaptor 1 



]RG RVDTLOSNAPSSMMVKOEYVHD SEQ ID NO: 276 

I i » 1 1 ■■ 1 . 1 i i ■ 1 i 

mTCAKGGACAGCCAlCGTrGTCCACTGAAGGACAnCAAnCAAACCATCCAGCATCCACCAAGTAAlCGlGC 

I — 1 ' ■ • ' — — > ■ ' > * 150. 

PEG 0_ P S L S T E G H S 1 0 T 1 J) H P P SNRA 

AlCGACAGAGACATACAGCACCCCAGCTClGTTAGCCCCAlCTGAGTCTAATGCTACCAGCACf GCCAACTTTCC- 
^ l . > . . . . . . . . ■ . 225 

stetystpallapsesnatstanfp 



CAACAT1CC1 G1GGC7TCCACAAGTCAGCC1GCCAG1 Al ACT GGGGGGCAGCCAT AGf GAAGGAC1 GUGCAGAT 
, — ■ — ' ' ■ 1 • 1 ■ ' ■ ' " »- 300 



NIPVASTSOPASILGGSHSEGLLOI 

! 1 » I I i I 1 1 ■ ■ I I 

AGCATCAGGGCCTCAGCCAGGACAGCAGCAGAAIGGATTl ACT GGTCAGCCAGCTAC71 ACCATCAT AACAGCAC 
• ■ ' ■ " ■ ' . 1 ' i 1 ^ 375 



asgpopgqqongf TGOPATYHHNST 



1 ACC ACCT GGAC7GGAAGT AGGACTGCACCAT ACACACCT AAITTGCCTCACCACCAAAA 
-i ■ » ■ > ■ ■ ■ 1 i ■ > 435 



ttwtgsrtapytpnlphhok 

I — I — ■ . ■ I I I I I ■ ■ I ■ I ■ ■ — 
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AATirGCGGCCGCGlCGACCGGCGGCCGAGGAGCGGCGGAClCCGGGCGCGGGGAGTCGAGGCATTTGCGCCTGG SEQ ID NO: 117 

j 1 , t- 1 1 h 1 . | 1 1 »— ' 1 ► 75, 

T1AAGCGCCGGCGCAGCTGGCCGCCGGC1CCTCGCCGCCTGAGGCCCGCGCCCCTCAGCKCGTAAACGCGGACC SEQ ID NO: 118 

j adaptor "| 

IR G RVDRRPRSGGLRARGVEAFAPG SEQ ID NO: 277 

. i . i . t . i i i i • i i i 1 1 i 

I 

rmfGGAGCGTAGCGCCAGGGCClGAGCCnTGAAGCAGGAGGAGGGGAGGAGAGAGTGGGGCTCCUTAlCGG 

I - I ; i ■ — H I ■ 1 i 1 ' 1 1 h 150 

CGAAGCCT CGCA7CGCGG7CCCGGACTCGGAAACTKGTCCTCCTCCCCTCCTCTCTCACCCCGAGGAGA1 AGCC 



I R S VAPGPEPLKOEEGR RE WGSS 1 G 

. — ; . i . i . i 1 1 > — » l 

rArrrrCTCCCCATGTGGATCTGCCCAGGCGGCGGCGGCGGAGGAGGCGACCGAGAAGATGCCCGCCCTGCGCCC 

, — f- ' — I 1 1 • 1 « 1 — I 1 H ► 225 

C1GGGGGAGGGG1 ACACC1 AGACGGGTCCGCCGCCGCCGCCTCCTCCGCTGGCTCTTCTACGGGCGGGACGCGGG 



TP S.PCGS.AOAAAAEEATEKMPALRP 



rrrTCTGCTGTGGGCGCTGCT GGCGC1C1GGCTGTGC1GCGCGACCCCCGCGCATGCATTGCAGTGTCGAGA1GG 

I 1 ' 1 ' I ' 1 H 1 1 • 1- 300 

GCGAGACGACACCCGCGACGACCGCGAGACCGACACGACGCGC1GGGGGCGCGT ACGTAACGTCACAGCTCTACC 



AL LWALLALWLCCATPAHALOCRDG 

, I ■ I I ■ I 1 ■ , > ■ . 1 ■ ■ . I I I ■ I I I II I 1 

run CAACCCTGTGT AAAlGAAGGAAT G1G1GT1 ACCT ACCACAATGGCAC AGGAT AC1GCAAATGTCCANAAGG 

, H —I 1 \ ~h 1 »— H ' I >— I ► 375 

GA7 ACT! GGGACACATT7 ACTICCTTACACACAATGGAIGGIGTTACCGTGTCCTATGACGTTTACAGG'TNTTCC 



YE pCVNEGMCVTYHNGTGYCKCP?G 

' X ■ » . I . I 1 1 » L 1 1 ■ i 



mcnGGGGGAAT ATI GTC AACA7CGANACCCCTG7GAGAAGAACCGCTGC 

til |.- ) ■ I i 1 ■ I 427 

GAAGAACCCCCTT ATAACAGTTGTAGCTNTGGGGACACTCTTCTTGGCGACG 

. " ► 

fL gEYCOHR?PCEKNRC 
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Pathogenicity 
Island Probe # 
(bp#) 


P. aeruginosa srains that gave 
Positive signal with the specific 
probe used 


P. aeruginosa srains that 
gave Negative signal with 
the specific probe used 


3 (25562-26456) 


PA14,CF2,CF6,CF26,CF29 


PA01,PAK,CF1,CF3, 
CF4,CF5,CF27,CF28, 
CF30, CF32 


4 (61181-o3oU/j 


rA14, Lr2, v_ro, L-rzo, Lrzy 


PA01,PAK,CF1,CF3, 
LF4, Cro, CF27, CF28, 
CF30, CF32 


5 (74933-76117) 


PAH, PA037, CF2, CF6, CF26 


PA01,PAK,CF1,CF3, 
CF4,CF5,CF27,CF28, 
CF29, CF30, CF32 


6 (84922-86622) 


PAOl, PA14, PA037, CF2, CF, 
CF26, 


PAK,CF1,CF3,CF4,CF5, 
CF27, CF28, CF29, CF30, 
CF32 


7 (103070-104556) 


PA14, PA037, CF2, CF6, CF26 


PA01,PAK,CF1,CF3, 
CF4, CF5, CF27, CF28, 
CF29, CF30, CF32 


8 (104799-105545) 


PA 14, CF2, CF6, CF26 


PA01,PAK,CF1,CF3, 
CF4, CF5, CF27, CF28, 
CF29, CF30, CF32 



These experiments indicate that at least part of the big island region contained in each 
probe is present in the P. aeruginosa strains that gave positive hybridization signal. 
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ORF7 Proteir* SEQ ID NO: 278 



minshllyrlsyrgtsffqpwtlpvlldsrlrgapfygcaracqpsdpksfssfstsdktalplhaaalsrlpdahekap 
pkrgfpcpppkrsgeddlvafhlrrdtgtrrefagqdqlrqrvldpaldgplqracaidrveadgnqlvqrllaqfqaql 
algqalaqateldlgdagdllasqrlehhhfvdpvdefrtevridrvhhcgtlrlavagqlldlrrtevgghhhhgvaev 
hrtpvtvgqasvlehleenveyirmgllhlvqqhhrvglaadrlgqvaafleadvarrradqaghrvflhelghiyphqr 
llg3eeelgqrlaqlglahpgraeeeeraarpvrigeagartahgvghgdyrlvladhspmqlllhaqqlla1alehlrh 
rdtgplgnhfgdflvghlvaqqlvlglavlvdhlqaafqvrdglvldarhalevalaprrlhlllglldllldlrralhl 
gllglpdllevgv falelddillqlgqalpggfvvfllqrlaldlqldqatvetiqflrlgvdlhadaagglvdqvdglv 

RQL PIG D V AVPvQL G RG D DRA VG DAH P VV H F I AFLEATE DG DG V FLAR FV H QHLLE AALQRG I LLDVLAI LV EG S S T D AVQ 

LAARQSRLEHVAGVHGTFRLAGADHGVQFVDEQDDPAFLLAQFVEDRLQAFLELAAELGTGDQRPHVQGQQALVLEAVRH 

FAVDDALGQALDDGGLADAGFADQHRVVLGPPLQDLDGPADLWATDHRVELAFLGALGHVDGVLVQRLARLLDVRVVHR 

FAATQVGHGILQRL^ RHALAE QQLAEPGVLVHRGQQYQLAGDELVALLLGQAVSLVEQACEILGQVHVAGRALD 

FFVEAAAQGGDIEADLHQQGLDRTALLLEQGGKQVHRLDGRMVMANGQGLGVGERQLQLAGQTVYSHGSSFLL. 
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ORF7 nucleotide sequence SEQ ID NO: 119 



ATGATTAACAGTCATTTGCTCTACCGACTGAGCTATCGCGGAACGTCTTTCTTCCAACCCTGGACGCTTCCGGTGTTGCT 

GGATTCGCGTCTCAGAGGCGCGCCATTTTACGGATGCGCGCGGGCATGTCAACCCTCTGATCCAAAAAGTTTTTCTTCTT 

TTTCCACGAGCGACAAAACGGCCCTTCCACTGCATGCGGCAGCGCTCTCGCGCCTACCGGACGCCCATGAAAAA<5CCCCG 

CCGAAGCGGGGCTTTCCCTGTCCGCCCCCGAAGAGGTCAGGCGAAGACGATCTCGTCGCCTTCCACCTTCGCCGAGATAC 

TGGCACCCGGCGCGAATTTGCCGGCCAGGATCAGTTGCGCCAGCGGGTTCTCGATCCAGCGCTGGATGGCCCGCTTCAGC 

GGGCGTGCGCCATAGACCGGGTCGAAGCCGACGGCAATCAGCTTGTCCAGCGCCTCCTGGCTCAGTTCCAGGCTCAGCTC 

GCGCTCGGCCAGGCGCTTGCGCAGGCGACCGAGCTGGATCTCGGCGATGCCGGCGATCTGCTCGCGAGCCAGCGGCTCGA 

ACACCACCACTTCGTCGATCCGGTTGATGAATTCCGGACGGMGTGCGCATTGACCGCGTCCATCACTGCGGCACGTTGC 

GCCTCGCGGTCGCCGGCCAGCTCCTGGATCTGCGCCGAACCGAGGTTGGAGGTCATCACCACCACGGTGTTGCGGAAGTC 

CACCGTACGCCCGTGACTGTCGGTCAGGCGTCCGTCCTCGAGCACCTGGAGGAGAATGTTGAATACATCCGGATGGGCCT 

TCTCCACCTCGTCCAGCAGCACCACCGAGTAGGGCTTGCGGCGGATCGCCTCGGTCAGGTAGCCGCCTTCCTCGAAGCCG 

ACGTAGCCCGGAGGCGCGCCGATCAGGCGGGCCACCGAGTGTTTCTCCATGAACTCGGACATA TCTAT CCGCACCAGCGC 

CTCCTCGGTATCGAAGAGGAACTCGGCCAG '" ~ 

CGCCTTGCACAACTCGGTCTTGCCCACCCCGGTCGGGCCGAGGAAGAGGAACGAGCCGCTCGGCCGGTTCGGATCGGCGA 
GGCCGGCGCGCGAACGGCGCACGGCGTTGGACACGGCGACTACCGCCTCGTCCTGGCCGATCACTCGCCGATGCAGCTCC 
TGCTCCATGCGCAGCAGCTTCTCGCGCTCGCCCTCGAGCATCTTCGACACCGGGATACCGGTCCACTTGGAAACCACTTC 
GGCGATTTCCTCGTCGGTCACCTTGTTGCGCAGCAACTGGTTCTCGGTCTTGCCGTGCTGGTCGACCATCTGCAGGCTGC 

TGCTTGGCCTGCTCGATCTTCTGCTGGATCTGCGCCGAGCCCTGCACCTCGGCCTTCTCGGACTTCCAGATCTCCTCGAG 
GTCGGCGTATTCGCGCTCGAGCTTGACGATATCCTCCTCCAGCTTGGCCAGGCGCTTCCTGGTGGCTTCGTCGTCTTCCT 
TCTTCAGCGCCTCGCGCTCGATCTTCAGCTGGATCAGGCGACGGTCGAGACGATCCAGTTCCTCCGGCTTGGAGTCGATC 
TCCATGCGGATGCGGCTGGCGGCCTCGTCGATCAGGTCGATGGCCTTGTCCGGCAGTTGCCGATCGGTGATGTAGCGGTG 
CGACAGCTTGGCCGCGGCGATGATCGCGCCGTCGGTGATGCTCACCCCGTGGTGCACTTCATAGCGTTCCTTGAGGCCAC 
GGAGGATGGCGATGGTGTCTTCCTCGCTCGGTTCGTCCACCAGCACCTTCTGGAAGCGGCGCTCCAGCGCGGCATCCTTC 
TCGATGTACTGGCGATACTCGTCGAGGGTAGTAGCACCGACGCAGTGCAGCTCGCCGCGCGCCAGAGCCGGCTTGAGCAT 
GTTGCCGGCGTCCATGGCACCTTCCGCCTT 

GCCGGCGCCGACCATGGTGTGCAGTTCGTCGATGAACAGGATGACCCGGCCTTCCTGCTTGCCCAGTTCGTTGAGGACCG 

CCTTCAGGCGTTCCTCGAACTCGCCGCGGAACTTGGCACCGGCGATCAGCGCCCCCATGTCCAGGGCCAGCAGGCGCTTG 

TCCTTGAGGCCGTCCGGCACTTCGCCGTTGATGATGCGCTGGGCCAGGCCCTCGACGATGGCGGTCTTGCCGACGCCGGG 

TTCGCCGATCAGCACCGGGTTGTTCTTGGTCCGCCGCTGCAGGACCTGGATGGTCCGGCGGATCTCGTCGTCGCGACCGA 

TCACCGGGTCGAGCTTGCCTTCCTCGGCGCGCTTGGTCATGTCGACGGTGTACTTGTCCAGCGCCTGGCGCGACTCCTCG 

ACGTTCGGGTCGTTCACCGCTTCGCCGCCACGCAGGTTGGCCACGGCATTCTCCAGCGCCTTGCGCGACACGCCCTGGCC 

GAGCAGCAGCTTGCCGAGCCTGGTGTTCTCGTCCATCGCGGCCAGCAATACCAGCTCGCTGGAGATGAACTGGTCGCCCT 

TCTGCTGGGCCAGGCGGTCAGCCTGGTTGAGCAGGCGTGCGAGATCCTGGGACAGGTTCACGTCGCCGGTCGGGCTCTGG 

ATCTTCGGCAGCGCGTCGAGTTCTTTGTTGAGGCCGCTGCGCAGGGCGGCGATATCGAAGCCGACCTGCATCAGCAGGGG 

CTTGATCGAACCGCCTTGCTGCTCGAGCAGGGCGGAAAGCAGGTGCACCGGCTCGATGGCCGGATGGTCATGGCCAACGG 

CCAGGGACTGGGCGTCGGAGAGCGCCAGTTGCAGCTTGCTGGTCAAACGGTCTATTCGCATGGGTCGTCCTTCCTTCTAT 

AG 
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clpB protein SEQ ID NO: 279 



MRIDRLTSKLQLALSDAQSLAVGHDHPAIEPVHLLSALLEQQGGSIKPLLHQVGFDIAALRSGLNKELDAL'PKIQSPTGD 
VNLSQDLARLLNQADRLAQQKGDQF3 SSELVLLAAMDENTRLGKLLLGQGVSRKALENAVANLRGGEAVNDPKVEESRQA 
LDKYTVDMTKRAEEGKLDPV1GRDDEIRRT3QVLQRRTKNNPVL1GEPGVGKTA1VEGLAQRI1NGEVPDGLKDKRLLAL 
DMGAL3 AGAKFRGEFEERLKAVLNELGKQEGRVI LF3 DELHTMVGAGKAEGAMDAGNMLKPALARGELHCVGATTLDEY.R 
QYIEKDAALERRFQKVLVDEPSEEDT3 A1LRGLKERYEVHHGVSITDGA3I AAAKLSHRY1TDRQLPDKA1 DL1DEAASR 
3RMEIDSKPEELDRLDRRL1QLKIEREALKKEDDEATRKRLAKLEED1VKLEREYADLEE1WKSEKAEVQGSAQIQQKIE 
QAKOEMEAARRKGDLESMAR3 QYQT-3 PDLERSLQMVDQHGKTENQLLRNKVTDEE3 AEWSKWTG1 PVSKMLEGEREKLL 
RMEQELHRRV1GQDEAVVAVSNAVRRSRAGLADPNRPSGSFLFLGPTGVGKTELCKALAEFLFDTEEALVRIDMSEFMEK 
HSVARL1GAPPGYVGFEEGGYLTEA3RRKPYSWLLDEVEKAHPDVFNILLQVLEDGRLTDSHGRTVDFRNTVWMTSNL 
GSAQ1 QELAGDREAQRAAVMDAVNAHFRPEF3 NR3 DEVWFEPLAREQ3 AG 3 AE3 QLGRLRKRLAERELSLELSQEALDK 
L1AVGFDPVYGARP3.KRA3QRW3ENPLAQL3LAGKFAPGAS3SAKVEGDE3VFA, 
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clpB DMA 1 , SEQ ID NO: 120 



ATGCGAATAGACCGTTTGACCAGCAAGCTGCAACTGGCGCTCTCCGACGCCCAGTCCCTGGCCGTTGGCCATGACCATCC 

GGCCATCGAGCCGGTGCACCTGCTTTCCGCCCTGCTCGAGCAGCAAGGCGGTTCGATCAAGCCCCTGCTGATGCAGGTCG 

GCTTCGATATCGCCGCCCTGCGCAGCGGCCTCAACAAAGAACTCGACGCGCTGCCGAAGATCCAGAGCCCGACCGGCGAC 

GTGAACCTGTCCCAGGATCTCGCACGCCTGCTCAACCAGGCTGACCGCCTGGCCCAGCAGAAGGGCGACCAGTTCATCTC 

CAGCGAGCTGGTATTGCTGGCCGCGATGGACGAGAACACCAGGCTCGGCAAGCTGCTGCTCGGCCAGGGCGTGTCGCGCA 

AGGCGCTGGAGAATGCCGTGGCCAACCTGCGTGGCGGCGAAGCGGTGAACGACCCGAACGTCGAGGAGTCGCGCCAGGCG 

CTGGACAAGTACACCGTCGACATGACCAAGCGCGCCGAGGAAGGCAAGCTCGACCCGGTGATCGGTCGCGACGACGAGAT 

CCGCCGGACCATCCAGGTCCTGCAGCGGCGGACCAAGAACAACCCGGTGCTGATCGGCGAACCCGGCGTCGGCAAGACCG 

CCATCGKGAGGGCCTGGCCCAGCGCATCATCAACGGCGAAGTGCCGGACGGCCTCAAGGACAAGCGCCTGGTGGCCGTG 



C^^CAGGAAGGCCGGGTCATCCTGTTCATCGACGT^ACTGCACACCATGGTCGGCGCCGGCAAGGCGGAAGGTGCCATGG 
ACGCCGGCAACATGCTCAAGCCGGCTCTGGCGCGCGGCGAGCTGCACTGCGTCGGTGCTACTACCCTCGACGAGTATCGC 
CAGTACATCGAGAAGGATGCCGCGCTGGAG 

CGCCGCTTCCAGAAGGTGCTGGTGGACGAACCGAGCGAGGAAGACACCATCGCCATCCTCCGTGGCCTCAAGGAACGCTA 

TGAAGTGCACCACGGGGTGAGCATCACCGACGGCGCGATCATCGCCGCGGCCAAGCTGTCGCACCGCTACATCACCGATC 

GGCAACTGCCGGACAAGGCCATCGACCTGATCGACGAGGCCGCCAGCCGCATCCGCATGGAGATCGACTCCAAGCCGGAG 

GAACTGGATCGTCTCGACCGTCGCCTGATCCAGCTGAAGATCGAGCGCGAGGCGCTGAAGAAGGAAGACGACGAAGCCAC 

CAGGAAGCGCCTGGCCAAGCTGGAGGAGGATATCGTCAAGCTCGAGCGCGAATACGCCGACCTCGAGGAGATCTGGAAGT 

CCGAGAAGGCCGAGGTGCAGGGCTCGGCGCAGATCCAGCAGAAGATCGAGCAGGCCAAGCAGGAGATGGAGGCGGCGCGG 

CGCAAGGGCGACCTCGAGAGCATGGCGCGCATCCAGTACCAGACCATCCCGGACCTGGAACGCAGCCTGCAGATGGTCGA 

CCAGCACGGCAAGACCGAGAACCAGTTGCTGCGCAACAAGGTGACCGACGAGGAAATCGCCGAAGTGGTTTCCAAGTGGA 

CCGGTATCCCGGTGTCGAAGATGCTCGAGGGCGAGCGCGAGAAGCTGCTGCGCATGGAGCAGGAGCTGCATCGGCGAGTG 

ATCGGCCAGGACGAGGCGGTAGTCGCCGTGTCCAACGCCGTGCGCCGTTCGCGCGCCGGCCTCGCCGATCCGAACCGGCC 

GAGCGGCTCGTTCCTCTTCCTCGGCCCGACCGGGGTGGGCAAGACCGAGTTGTGCAAGGCGCTGGCCGAGTTCCTCTTCG 

ATACCGAGGAGGCGCTGGTGCGGATAGATATGTCCGAGTTCATGGAGAAACACTCGGTGGCCCGCCTGATCGGCGCGCCT 

CCGGGCTACGTCGGCTTCGAGGAAGGCGGC 

TACCTGACCGAGGCGATCCGCCGCAAGCCCTACTCGGTGGTGCTGCTGGACGAGGTGGAGAAGGCCCATCCGGATGTATT 

CAACATTCTCCTCCAGGTGCTCGAGGACGGACGCCTGACCGACAGTCACGGGCGTACGGTGGACTTCCGCAACACCGTGG 

TGGTGATGACCTCCAACCTCGGTTCGGCGCAGATCCAGGAGCTGGCCGGCGACCGCGAGGCGCAACGTGCCGCAGTGATG 

GACGCGGTCAATGCGCACTTCCGTCCGGAATTCATCAACCGGATCGACGAAGTGGTGGTGTTCGAGCCGCTGGCTCGCGA 

GCAGATCGCCGGCATCGCCGAGATCCAGCTCGGTCGCCTGCGCAAGCGCCTGGCCGAGCGCGAGCTGAGCCTGGAACTGA 

GCCAGGAGGCGCTGGACAAGCTGATTGCCGTCGGCTTCGACCCGGTCTATGGCGCACGCCCGCTGAAGCGGGCCATCCAG 

CGCTGGATCGAGAACCCGCTGGCGCAACTGATCCTGGCCGGCAAATTCGCGCCGGGTGCCAGTATCTCGGCGAAGGTGGA 

AGGCGACGAGATCGTCTTCGCCTGA 
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A. PAPI-1 
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A. PAPI-1 



5,086,881 
5,106,035 5,087,000 5,086,925 



5,086,881 5,086,925 5,056,091 
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B. PAPI-2 

105,790 106,510 106,833 



tRNA-Lys 



1,069,543 1,069,669 1,088,331 



H44 
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FIGURE 63 
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Positions 



Length (bp) Number of identical bp 



Genes between the repeats 
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IS name PAPI-1 PAP1-2 Characteristics oflS 

Position Length (bp) Position Length (bp) Original length (bp) IS family 

ISPpuH 56,778-59,119 2341 7,034-7,999 966 2^83 1S66 

1S222 - - 2,980-4,201 1,222 1,232 IS3 
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FIGURE 67 

ORF 7 (SEQID NO: 280) 

L^FGSATOTRTRDP 

GFPCPPPKRSGEDDLVAFHLRRDTGTRREFAGQDQLRQRVLDP 
5 LASQRLEHHHFVDPVDEFRTEWIDRVHHCGTLRLAVAGQLLDLRRTEVGGHHHHGVAEVHRTPVTVGQASVLEHLEENVEYIRMGLLHLVQQHHRV 
GLAADRLGQVAAFLEADVARRRADQAGHRVFLHELGHIYPHQRLLGIEEELGQRLAQLGLAHPGRAEEEERAARPVRIGEAGARTAHGVGHGDYRLV 
LADHSPMQLLLHAQQLLALALEHLRHRDTGPLGNHFGDFLVGHLVAQQLVLGLAVLVDHLQAAFQVRDGLVLDARHALEVALAPRRLHLLLGLLDLL 
LDLRRALHLGLLGLPDLLEVGVFALELDDILLQLGQALPGGFWFLLQRLALDLQLDQATVETIQFLRLGVDLHADAAGGLVDQVDGLVRQLPIGDV 
AVRQLGRGDDRAVGDAHPVWFIAFLEATEDGDGVFLARFVHQHLLEAALQRGILLDVLAILVEGSSTDAVQL^RQSRLEHVAGVHGTFRLAGADH 
10 G VQ F VD EQDD P AFL L AQ F VEDRLQ AF L E L AA E LGTGDQR PH VQGQQAL VL E A VRH F A VDDALGQALDDGGLADAG F ADQHR WLGP PLQDLDG P ADL 
WATDHRVELAFLGALGHVDGVLVQRLARLLDVRWHRFAATQVGHGILQRLARHALAEQQLAEPGVLVHRGQQYQLAGDELVALLLGQAVSLVEQA 
CEILGQVHVAGRALDLRQRVEFFVEAAAQGGDIEADLHQQGLDRTALLLEQGGKQVHRLDGRMVl^GQGLGVGERQLQLAGQTVYSHGSSFLL 

15 ORF7 <SEQ ID NO: 281) 

TTGGAATTTGGCTCCGCGACCTGGACTCGAACCAGGGACCCA ATGATTAACAGTCATTTGCTCTACCGACTGAGCTATCGCGGAACGTCTTTCTTCC 
AACCCTGGACGCTTCCGGTGTTGCTGGATTCGCGTCTCAGAGGCGCGCCATTTTACGGATGCGCGCGGGCATGTCAACCCTCTGATCCAAAAAGTTT 
TTCTTCTTTTTCCACGAGCGACAAAACGGCCCTTCCACTGCATGCGGCAGCGCTCTCGCGCCTACCGGACGCCCATGAAAAAGCCCCGCCGAAGCGG 
GGCTTTCCCTGTCCGCCCCCGAAGAGGTCAGGCGAAGACGATCTCGTCGCCTTCCACCTTCGCCGAGATACTGGCACCCGGCGCGAATTTGCCGGCC 

20 AGGATCAGTTGCGCCAGCGGGTTCTCGATCCAGCGCTGGATGGCCCGCTTCAGCGGGCGTGCGCCATAGACCGGGTCGAAGCCGACGGCAATCAGCT 
TGTCCAGCGCCTCCTGGCTCAGTTCCAGGCTCAGCTCGCGCTCGGCCAGGCGCTTGCGCAGGCGACCGAGCTGGATCTCGGCGATGCCGGCGATCTG 
CTCGCGAGCCAGCGGCTCGAACACCACCACTTCGTCGATCCGGTTGATGAATTCCGGACGGAAGTGCGCATTGACCGCGTCCATCACTGCGGCACGT 
TGCGCCTCGCGGTCGCCGGCCAGCTCCTGGATCTGCGCCGAACCGAGGTTGGAGGTCATCACCACCACGGTGTTGCGGAAGTCCACCGTACGCCCGT 
GACTGTCGGTCAGGCGTCCGTCCTCGAGCACCTGGAGGAGAATGTTGAATACATCCGGATGGGCCTTCTCCACCTCGTCCAGCAGCACCACCGAGTA 

25 GGGCTTGCGGCGGATCGCCTCGGTCAGGTAGCCGCCTTCCTCGAAGCCGACGTAGCCCGGAGGCGCGCCGATCAGGCGGGCCACCGAGTGTTTCTCC 
ATGAACTCGGACATATCTATCCGCACCAGCGCCTCCTCGGTATCGAAGAGGAACTCGGCCAGCGCCTTGCACAACTCGGTCTTGCCCACCCCGGTCG 
GGCCGAGGAAGAGGAACGAGCCGCTCGGCCGGTTCGGATCGGCGAGGCCGGCGCGCGAACGGCGCACGGCGTTGGACACGGCGACTACCGCCTCGTC 
CTGGCCGATCACTCGCCGATGCAGCTCCTGCTCCATGCGCAGCAGCTTCTCGCGCTCGCCCTCGAGCATCTTCGACACCGGGATACCGGTCCACTTG 
GAAACCACTTCGGCGATTTCCTCGTCGGTCACCTTGTTGCGCAGCAACTGGTTCTCGGTCTTGCCGTGCTGGTCGACCATCTGCAGGCTGCGTTCCA 

30 GGTCCGGGATGGTCTGGTACTGGATGCGCGCCATGCTCTCGAGGTCGCCCTTGCGCCGCGCCGCCTCCATCTCCTGCTTGGCCTGCTCGATCTTCTG 
CTGGATCTGCGCCGAGCCCTGCACCTCGGCCTTCTCGGACTTCCAGATCTCCTCGAGGTCGGCGTATTCGCGCTCGAGCTTGACGATATCCTCCTCC 
AGCTTGGCCAGGCGCTTCCTGGTGGCTTCGTCGTCTTCCTTCTTCAGCGCCTCGCGCTCGATCTTCAGCTGGATCAGGCGACGGTCGAGACGATCCA 
GTTCCTCCGGCTTGGAGTCGATCTCCATGCGGATGCGGCTGGCGGCCTCGTCGATCAGGTCGATGGCCTTGTCCGGCAGTTGCCGATCGGTGATGTA 
GCGGTGCGACAGCTTGGCCGCGGCGATGATCGCGCCGTCGGTGATGCTCACCCCGTGGTGCACTTCATAGCGTTCCTTGAGGCCACGGAGGATGGCG 

35 ATGGTGTCTTCCTCGCTCGGTTCGTCCACCAGCACCTTCTGGAAGCGGCGCTCCAGCGCGGCATCCTTCTCGATGTACTGGCGATACTCGTCGAGGG 
TAGTAGCACCGACGCAGTGCAGCTCGCCGCGCGCCAGAGCCGGCTTGAGCATGTTGCCGGCGTCCATGGCACCTTCCGCCTTGCCGGCGCCGACCAT 
GGTGTGCAGTTCGTCGATGAACAGGATGACCCGGCCTTCCTGCTTGCCCAGTTCGTTGAGGACCGCCTTCAGGCGTTCCTCGAACTCGCCGCGGAAC 
TTGGCACCGGCGATCAGCGCCCCCATGTCCAGGGCCAGCAGGCGCTTGTCCTTGAGGCCGTCCGGCACTTCGCCGTTGATGATGCGCTGGGCCAGGC 
CCTCGACGATGGCGGTCTTGCCGACGCCGGGTTCGCCGATCAGCACCGGGTTGTTCTTGGTCCGCCGCTGCAGGACCTGGATGGTCCGGCGGATCTC 

40 GTCGTCGCGACCGATCACCGGGTCGAGCTTGCCTTCCTCGGCGCGCTTGGTCATGTCGACGGTGTACTTGTCCAGCGCCTGGCGCGACTCCTCGACG 
TTCGGGTCGTTCACCGCTTCGCCGCCACGCAGGTTGGCCACGGCATTCTCCAGCGCCTTGCGCGACACGCCCTGGCCGAGCAGCAGCTTGCCGAGCC 
TGGTGTTCTCGTCCATCGCGGCCAGCAATACCAGCTCGCTGGAGATGAACTGGTCGCCCTTCTGCTGGGCCAGGCGGTCAGCCTGGTTGAGCAGGCG 
TGCGAGATCCTGGGACAGGTTCACGTCGCCGGTCGGGCTCTGGATCTTCGGCAGCGCGTCGAGTTCTTTGTTGAGGCCGCTGCGCAGGGCGGCGATA 
TCGAAGCCGACCTGCATCAGCAGGGGCTTGATCGAACCGCCTTGCTGCTCGAGCAGGGCGGAAAGCAGGTGCACCGGCTCGATGGCCGGATGGTCAT 

45 GGCCAACGGCCAGGGACTGGGCGTCGGAGAGCGCCAGTTGCAGCTTGCTGGTCAAACGGTCTATTCGCATGGGTCGTCCTTCCTTCTATAG 
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FIGURE 68 (SEQ ID NO: 121) 



ACGTCGGGGGCGCA ^TGc tA| C GCCTG|cAgAAT| GGTTTCAGGGCCTTAGAAACAGAAAAGCCCACC jfa^|^ 

||ggcgggctattccatattga |cM 

gaaccggcgtaacaccccttcctagcctatgcaactcgccccgtagaaaa tggtgggtcgtgtaggattc 
gaacctacgaccaattggttaaaagccaactgctctaccgactgagctaacgaccca agtatgagg tggt 
cggggtagagagattcgaactcccgacatcctgctcccaaagcaggcgcgctaccggactgcgctatacc 
ccgattggaatttggctccgcgacctggactcgaaccagggacccaatg 



Title: VIRULENCE-ASSOCIATED NUCLEIC ACIDS AND 
PROTEINS AND USES THEREOF 
Applicants: Laurence Rahme et al. 

Filing Date: September 12, 2003 Serial No.: Not Yet Assigned 
Page 116 of 118 Customer No.: 21559 

FIGURE 69 
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FIGURE 7 OA 

CLUSTAL W (1.82) multiple sequence alignment 



CF2 (SEQ ID NO: 122) G ATGAAQG - ACCCGAGCGGAACATCCATCTCAC 32 

PA037 (SEQ ID NO: 123) GATGAAGG- ACCCGAGCGGAACATCCATCTCAC 32 

CF6 (SEQ ID NO: 124) - GATGAAGGCACCCGAGCGGAACATCCACCTCAC 33 

PA14 (SEQ ID NO: 125) TTTCCGGCACACCCTGGCCACCGACTTGATGAAGGCACCCGAGCGGAACATTCACCTCAC 60 

CF26 (SEQ ID NO: 126) GATGAAGG -ACCCGAGCGGAACATCCACCTCAC 32 

******** *************** ** ***** 

CF2 G AAGTGC CTGCT C AA CC ACTCG AAT ATC CAG ACC ACCATG AGCT AC AT CG AGGCCG ACT A 92 

PA03 7 G AAGTGC CTGCT C AAC C ACT C G AAT ATC CAG ACC ACCATG AGCT AC AT CG AGGCCG ACTA 92 

CF6 GAAGTGCCTGCTCAACCACTCGAATATCCAGACCACCATGAGCTACATCGAGGCCGACTA 93 

PA14 G AAGTGC CTGCT C AA CC ACT CG AAT AT CC AG AC CA CGATG AGCT AC AT CG AGGCCG ACTA 120 

CF2 6 G AAG TGC CTGCTC AACC A CT CG AAT ATC CAG ACC ACGATG AGCT A C AT CG AGG CCGACTA 92 



CF2 CGACCACATGCGTGCCGTGCTGCATGCCAGAAGCCTGGCCCAAGGAGCGCTGGAGAACGT 152 

PA03 7 CGACCACATGCGTGCCGTGCTGCATGCCAGAAGCCTGGCCCAAGGAGCGCTGGAGAACGT 152 

CF6 CGACCACATGCGTGCCGTGCTGCATGCCAGAAGCCTGGCCCAAGGAGCGCTGGAGAACGT 153 

PA14 CGATCACATGCGTGCCGTGCTGCATGCTAGAAGCCTGGCCCAAGGCGCGCTGGAGAATGT 180 

CF2 6 CGATCACATGCGTGCCGTGCTGCATGCTAGAAGCCTGGCCCAAGGCGCGCTGGAGAATGT 152 
*** *********************** ***************** *********** ** 

CF2 CAGGAAGGTGGATTACAGCGGCTCCCCGCAAGCCTCTGCCAAACCGAAGCCATGCGGGCA 2 12 

PA03 7 CAGGAAGGTGGATTACAGCGGCTCCCCGCAAGCCTCTGCCAAACCGAAGCCATGCGGGCA 2 12 

CF6 CAGGAAGGTGGATTACAGCGGCTCCCCGCAGGCCTCTGCCAAACCGAAGCCATGCGGGCA 213 

PA14 CAGGAAGGTGGATTACAGCGGCTCCCCGCAAGCCTCTGCCAAACCGAAGCCATGCGGGCA 24 0 

CF2 6 C AGG AAGGTGGATT AC AGCGG CTC CCCG C AAGCCTCTGCCAAACCGAAACC ATGCGGGCA 212 
****************************** ***************** *********** 

CF2 ACCTCTCGCTCGAATGGGTG AAGTACCGCCGCCGGAGGCCAGGACAGAACCTG CAG AACC 272 

PA03 7 ACCTCTCGCTCGAATGGGTGAAGTACCGCCGCCGGAGGCCAGGACAGAACCTGCAGAACC 272 

CF6 ACCTCTCGCTCGAATGGGTG AAG T AC CGCCGC CGG AGGCC AGG AC AG AACCTGCAG AACC 273 

PA1 4 ACCTCTCGCTCGAGTGAGTGAAGCGCCGCCACCGGAGGCCAGGACAGAGCCTGCAGAACC 300 

CF26 ACCTCTCGCTCGAGTGAGTGAAGCGCCGCCACCGGAAGCCAGGACAGAGCCTGCAGAACC 272 



CF2 AAGGGAGCACATACCAGGGACAGGCATTCAGGGAGGTCCAACCGTGCGGGAAGAAG C 329 

PA037 AAGGG AG C ACAT A CC AGGG A C AGGC ATT C AGGG AGGTCCAACCGTGCGGG AAGAAG C 329 

CF6 AAGGGAG C ACAC A CC AGGG A C AGGC ATT C AGGG AGGTCCAACCGTGCGGG AAG AAG - - - C 330 

PA1 4 AAGGG AGCACACGCCAGGGACAGGCATTCAGGGAGGTCCAACCGCGTGGGAAGCAGATGC 360 

CF26 AAGGGAGCACACACCAGGGAC AGGCATTCAGGGAGGTCCAACCGAGTGGGAAGCAGAAGC 332 
*********** ******************************* * ****** ** * 

CF2 GCTACCACAGCCACCTGACACCTTCGACCAAAGCGTGCTGTTCACTCTGATGGCTCAACA 389 

PA03 7 GCTACCACAGCCACCTGACACCTTCGACCAAAGCGTGCTGTTCACTCTGATGGCTCAACA 389 

CF6 GCTACCACAGCCACCTGACACCTTCGATCAAAGCGTGCTGTTCACTCTGATGGCTCAACA 390 

PA14 GCTACCACAGCCACCTGACACCTTCGAACCAAGCGTGCTGTTCACTCTGATGGCTCAAAA 420 

CF2 6 GCTACCACAGCCACCTGACACCTTCGAGCAAAGCGTGCTGTTCACTCTGATGGCTCAACA 392 
*************************** * **************************** * 

CF2 CTTATCGAACCGTGCCGCCTCGGCATCCGCGGCTCCCGCTGCAACAAGCGGATCTGGTGG 44 9 

PA03 7 CTTATCGAACCGTGCCGCCTCGGCATCCGCGGCTCCCGCTGCAACAAGCGGATCTGGTGG 449 

CF6 CTTATCG AACCGTGCCGC CT CGGC ATCCG CAGCT CCCGCTG CAACAAGCGG ATCTGGTGG 4 50 

PA1 4 CTTATCG AACCGTGC CGC CT CGGC AT C CGCGGCT CCCGCTGC AACAAGCGG ATC AGGCGG 4 80 

CF2 6 CTTATCGAACCGTGCCGCCACGACATCTGCGGCTCCCGCCGCAACCAGCGGATCTTGTAG 452 
******************* ** **** ** ******** ***** ******** * * 

CF2 ATGGGGATCTACTGCCCG AAGCAGTCTCGCCTAGCGATACCGATACTGAAGGGCCGGCTA 509 

PA03 7 ATGGGGATCTACTGCCCG AAGCAGTCTCGCCTAGCGATACCGATACTGAAGGGCCGGCTA 509 

CF6 ATGGGGATCTACCGCCCG AAGCAGTCTCGCCTAGCGATACCGGTACTGAAGGGCCGGCTA 510 

PA1 4 ATGGGGATCTGCCGCCCG AAGCAATCTCGCCTAGCGATACCGGTACTGA - GGGCCGGCTA 539 

CF26 ATGGGGATCTGCCGCCCGAAGCAGCCTCGCCTAGCGATACCGGTACTGAGGGGCCGGCTA 512 
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CF2 CCGGACGAAAGGTAGCCGCGCCTCCCAGCAGTTCGCTAGGCCTGTAAGAAAAATCTGGAA 569 

PA03 7 CCGGACGAAAGGTAGCCGCGCCTCCCAGCAGTTCGCTAGGCCTGTAAGAAAAATCTGGAA 569 

CF6 CCGGACG AAAGGTAGC CGCGCCT C C CAGC AGT T CG CT AGGC C TGT AGG AAAAAT CTGG AA 570 

PA1 4 CCGGACG AAAGGTAGC CG TG CCTTC CAG C AG AT CGTT AGGC C TGT AGG AAAAAT CTGG AA 599 

CF26 C CAG ACGAAAGGTAG CCG CG CCT CCCAG CAGAT CGC TGGGC C TGT AGG AAAAATCTGG AA 572 

** *************** **** ******* *** * ******** ************* 

CF2 TTACCGAGAGCGCCTGGATTCCAGCGCCGGCATGCTGGCAGAGCCC - CGCAGTTTCACGG 628 

PA03 7 TTACCGAGAGCGCCTGGATTCCAGCGCCGGCATGCTGGCAGAGCCC - CGCAGTTTCACGG 628 

CF6 TTACCGAGAGCGCCTGGATTCCAGCGCCGGCATGCTGGCAGGGCCC - CGCAATTTCAAGG 62 9 

PA14 TTACCGAGAGCGCCTGGATTCCAGCGCCGGCATGCTGGCAGAGCCAGCGCAATTTCAAGG 659 

CF2 6 T T AC CG AG AGCG CCTGG ATT CC AGCGCCGGC ATGC TGGCAG AGC C C - CGCAATTTCACGG 631 

***************************************** *** **** ***** ** 

CF2 CCAAAACCGCAGTACCCTCTGTAATCGCTGATTACGTCGGGGGCGCATTGCTACGCCTGC 688 

PA03 7 CCAAAACCGCAGTACCCTCTGTAATCGCTGATTACGTCGGGGGCGCATTGCTACGCCTGC 688 

CF6 C - GAAACCGCAGTACCCTCTGTAATCGCTGATTACGTCGAGGGCACATTGCTACGCCTGC 688 

PA14 CCAATACCACAGTACCCTCTGTAATCGCTGATTACGTCGGGGGCGCATTGCTACGCCTGC 719 

CF26 C - AAAACCGCAGTACCCTCTGTAATCGCTGATTACGTCGGGGGCACATTGCTACGCCTGC 690 

* * *** ****************************** **** *************** 

CF2 AGAA - TGGTTTCAGGGCCTTANAAACAGAAAAGCCCACCTTAAATAGGCGGGCTATT - CC 746 

PA037 AGAAATGGTTTCAGGGCCTTAGAAACAGAAAAGCCCACCCTAAATAGGCGGGCTATT-CC 74 7 

CF6 AGAA - TGGTTTCAGAGCCT - GAAAACAGAAAAGNCCACC - TAAATAGGCGGGCTATTTCC 74 5 

PA14 AGAA - TGGT0TCAGGGCCTTAGAAACAGAAAAGCCCACC - TAGAAAGGCGGGCTATT - CC 776 

CF26 AGAA - TGGTTTCAGAGCCTTANAAACAGAAAAGCCCACC - TAGATAGGCGGGCTATT - CC 74 7 

**** ********* **** *********** ***** ** * ************ ** 

CF2 ATATT - GACATCACG - TCAATGCGGG - - CCTAATGTTC - -GGCCCAGACGGCTG - -CTGG 798 

PA03 7 ATATT - GACATCACG - TCAATGCGGG - - CCTAATGTTC - - GGCCCANACGGCTG - - CTGG 799 

CF6 ATATTTGA CATC CCG - TCAATGCGGGGCCCTAATGGTTCGGGCCCANACGGCTTGCTTGG 804 

PA1 4 ATATT - GACATCACG - TCAATGCGGG - - CCTAATGTTC - - GGCCCAGACGGCTG - - CTAG 828 

CF26 ATATT - GAC ATCACGGTCAATGCGGG - - GCTAATGTTC - GGGCCCANACGGNTG CAA 800 

***** ****** ** ********** ****** * ****** **** * 



